AD-A245  328 


L/ 


/  - 


MATCHING  AND  VERTEX  PACKING:  HOW  “HARD”  ARE  THEY? 

by 


( 


Michael  D.  Plummer* 
Department  of  Mathematics 
Vanderbilt  University 
Nashville,  Tennessee  37235,  USA 


ABSTRACT 


Two  of  the  most  well-known  problems  in  graph  theory  are: 


(a)  Find  a  maximum  matching  (or  perfect  matching,  if  one  exists),  and 


(b)  Find  a  maximum  independent  set  of  vertices. 


The  first  problem — usually  called  the  matching  problem — is  known  to  have  a  poly¬ 
nomial  algorithm;  the  second — often  called  the  vertex  packing  problem — is  known  to  be 
NP-complete.  However,  many  graph  theorists— especially  those  who  do  not  deal  much 
with  complexity  of  algorithms — know  little  more  about  the  complexity  issues  associated 
with  these  two  problems  than  these  two  basic  facts. 

What  is  not  so  widely  known  within  the  graph  theory  community  is  that  these  two 
problems  have  motivated  a  great  deal  of  recent  activity  in  the  area  of  algorithms  and 
their  complexity.  Of  course  it  is  not  known  whether  or  not  P  =  NP,  but  most  workers 
in  the  area  currently  believe  that  equality  is  unlikely  to  hold.  Motivated  by  this  belief,  a 
number  of  people  have  studied  variations  of  both  matching  and  vertex  packing  with  the 
general  theme  being  two-fold.  On  the  one  hand,  one  cam  add  various  side  conditions  to  the 
matching  problem  and  study  the  complexity-both  sequential  and  parallel-of  the  resulting 
problems.  On  the  other  hand,  one  can  investigate  certain  large  and  interesting  classes  of 
graphs  trying  to  prove  that  for  these  classes  the  vertex  packing  problem  has  a  polynomial 
solution. 

Each  branch  of  this  two-pronged  attack  has  yielded  both  interesting  theorems  and 
perplexing  unsolved  problems.  This  paper  will  survey  this  work. 


1  Introduction  and  Background:  The  two  fundamental  Problems 

In  this  paper,  graphs  will  be  assumed  to  be  connected,  undirected  and  will  have  no 
multiple  edges  or  loops.  A  matching  is  any  set  of  independent  edges;  i.e.,  no  two  have 
a  vertex  in  common.  A  maximal  matching  is  a  matching  not  properly  contained  in  any 
other  matching.  A  maximum  matching  is  one  of  largest  cardinality.  A  perfect  matching 
(sometimes  called  a  1-factor)  in  a  graph  G  is  a  matching  which  covers  all  vertices  of  G. 
A  set  of  vertices  S  C  V{G)  is  independent  if  no  two  vertices  of  S  are  adjacent.  An 
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independent  set  S  is  maximal  if  it  is  not  a  proper  subset  of  any  other  independent  set 
and  maximum  if  it  is  aji  independent  set  of  largest  cardinality. 

So  fax,  then,  maximal  and  maximum  matchings  and  independent  sets  are  quite  analo¬ 
gous;  matchings  corresponding  to  sets  of  edges  and  independent  sets  corresponding  to  sets 
of  vertices.  To  be  sure,  they  are  related.  A  (maximal,  maximum)  matching  in  a  graph  G 
corresponds  to  a  (maximal,  maximum)  independent  set  in  the  line  graph  L{G).  But  we 
shall  soon  see  that  the  concepts  quickly  diverge  in  difficulty. 

First,  however,  let  us  note  that  matching  and  vertex  packing  remain  closely  related,  at 
least  in  the  computational  sense,  if  one  considers  the  class  of  bipartite  graphs.  Historically 
speaking,  it  was  this  class  of  graphs  which  was  first  studied  with  the  task  in  mind  of 
finding  maximum  matchings  and  independent  sets.  Implicit  in  the  early  work  of  Konig 
and  Egervary  were  the  roots  of  the  first  bipartite  matching  algorithms.  (For  much  more 
complete  histories  of  matching  algorithms  see  [l]  and  [2].)  Using  clcissical  alternating  path 
(or  slightly  more  efficiently,  alternating  tree)  arguments,  one  can  find  an  algorithm  to  find 
a  maximum  matching  in  a  bipartite  graph.  Moreover,  the  algorithm  provides  a  maximum 
matching  in  a  number  of  steps  polynomial  in  the  size  of  the  input  encoding  of  the  graph. 
(See  the  next  section  for  more  information  on  polynomial  and  other  types  of  complexity.) 

As  an  important  bonus,  however,  these  classical  bipartite  matching  algorithms  also 
yield  a  minimum  vertex  cover  of  the  graph;  i.e.,  a  smallest  set  of  vertices  which  col¬ 
lectively  contain  at  least  one  endvertex  of  every  edge  in  the  graph.  One  of  the  classical 
results  of  bipartite  matching  and  covering  due  to  Konig  [3,4]  says  that  the  size  of  any 
maximum  matching  equals  the  size  of  any  minimum  vertex  cover.  At  this  point,  one  need 
only  observe  the  simple,  but  important,  fact  that  the  complement  of  a  (minimum)  vertex 
cover  must  be  a  (maximum)  independent  set  of  vertices  (cf.  Gallai  [5])  and  presto!  We 
have  a  polynomial  algorithm  for  vertex  packing. 

For  graphs  which  are  not  bipartite,  however,  the  situation  changes  dramatically.  Com¬ 
putationally  the  two  roads  of  matching  and  vertex  packing  diverge  quickly.  We  will  follow 
the  route  of  matching  first. 

Although  the  (polynomial)  roots  of  matching  algorithms  for  bipartite  graphs  date  to 
the  1930’s  with  Konig-Egervary,  it  was  not  until  1965  that  the  first  polynomial  algorithm 
for  graphs  in  general  was  found  by  Edmonds  [6].  Edmonds  himself  gave  an  implementation 
in  0{n^)  time.  (As  we  write  this  paper,  Blum  [7,8,9]  claims  that  the  Edmonds’  implemen¬ 
tation  can  be  shortened  to  0{n^]  without  complications.)  Since  Edmonds’  original  paper, 
there  have  been  a  number  of  papers  successively  reducing  the  time  bound. 

For  the  past  ten  years  or  so,  Micali  and  V.V.  Vazirani  [10]  have  held  the  record  of 
0{y/nm)  time.  Interestingly,  the  first  proof  of  correctness  of  this  algorithm  did  not  appear 
until  1989  [ll[  and  is  some  forty  pages  in  length! 

It  is  interesting  to  note  that  again  xmtil  very  recently  the  bound  for  bipartite  graphs 
has  been  no  better  thaji  that  for  general  graphs,  although  Hopcroft  and  Karp  [12,13] 
attained  the  0{\/nm)  bound  some  years  eairlier  than  did  Micali  and  Vazirani.  This  bound 
stood  until  earlier  this  year  when  new  breakthroughs  in  bipartite  matching  were  made  by 
the  quartet  of  Alt,  Blum,  Mehlhorn  and  Paul,  and,  independently,  by  Feder  and  Motwani 
[l4l.  The  first  four  authors  [15]  improved  the  time  bound  to  0{n^'^ \/m/  log  n)  using  a 
new  “Tast  adjacency  matrix  scanning  technique”  due  to  Cheriyan,  Hagerup  and  Mehlhorn 


[16].  This  bound  improves  the  old  time  bound  by  a  factor  of  \/log  n  when  the  graph  is 
dense — i.e.,  when  the  number  of  edges  is  O(n^).  Feder  and  Motwani,  on  the  other  hand, 
developed  an  algorithm  which  is  even  faster.  Their  algorithm  finds  a  maximum  matching 
in  a  bipartite  graph  in  time  where  /c{n,m)  =  (log n)/[log(n^/m)]. 

But  now  let  us  return  to  the  fork  in  the  road  and  discuss  vertex  packing  for  graphs  in 
general.  The  best  deterministic  sequential  algorithm  known  to  the  author  is  0(2”/^)  due 
to  Tarjan  and  Trojcinowski  [17]  which  cleaxly  shows  the  large  gap  presently  extant  between 
matching  and  vertex  packing. 

Consider  the  special  case  when  G  is  the  line  graph  of  another  graph,  a  situation  which 
was  observed  above  to  guarantee  a  polynomial  algorithm  for  vertex  packing  in  G.  It  is 
well-known  that  line  graphs  have  a  characterization  in  terms  of  a  list  of  nine  forbidden 
induced  subgraphs  (cf.  Harary  [18]).  Of  these  nine  subgraphs,  perhaps  the  most  widely 
studied  is  the  claw  Ki^^.  A  graph  containing  no  induced  claw  is  said  to  be  claw-free.  In 
1980,  Minty  [19]  and  Sbihi  [20]  independently  proved  that  if  a  graph  is  claw-free,  then  the 
vertex  packing  problem  can  be  solved  in  polynomial  time. 

The  remainder  of  this  paper  will  be  organized  as  follows.  In  Section  2  we  present  a 
rather  intuitive  survey  of  the  complexity  classes  in  which  we  will  be  primarily  interested, 
together  with  an  illustration  showing  the  known  containment  relationships  among  the 
classes.  Section  3  contains  a  number  of  variations  on  matching  which  have  been  formulated 
and  the  status  of  their  complexity,  if  known.  Section  4  deals  with  vertex  packing  variants 
and  relations  in  much  the  same  way  that  Section  3  deals  with  matching.  Sections  5  and  6 
deal  with  matching  and  vertex  packing  in  parallel,  respectively.  Section  7  deals  with  the 
status  of  the  matching  and  vertex  packing  problems  involving  counting,  both  exact  and 
approximate.  Finally,  Section  8  is  a  short  introduction  to  the  work  on  lower  bounds  for 
the  complexity  of  matching  and  vertex  packing,  especially  that  of  Razborov. 

We  conclude  with  a  list  of  over  260  references. 


2.  Complexity  Classes 

So  far  we  have  spoken  only  about  algorithms  which  are  polynomial .  But  what  does 
this  mean  precisely?  What  kind  of  computational  devices  are  being  \ised?  Does  it  matter? 
What  is  “NP”  anyway?  In  order  not  to  get  bogged  down,  we  will,  for  the  most  paxt, 
unashamedly  sidestep  these  important  issues,  trying  instead  to  present  our  complexity 
results  at  a  more  intuitive  level.  Fortunately,  we  have  excellent  resources  to  fall  back  on. 
The  “bible”  of  complexity  theory  remains  the  book  of  Garey  and  Johnson  [2l],  together 
with  Johnson’s  ongoing  guide  in  the  Journal  of  Algorithms.  This  author,  himself  a  neophyte 
in  the  jungles  of  complexity  theory,  feels  compelled  to  admit  that  after  giving  the  conference 
talk  which  corresponded  to  a  rough  first  draft  of  this  paper,  soon  found  himself  spending 
nearly  a  year  in  the  subsequent  ferreting  out  of  such  things  as  what  a  “Monte  Carlo” 
algorithm  truly  is  (not  everyone  quite  agrees  on  the  definition,  it  seems!)  and,  most  of 
all,  trying  to  compile  the  table  of  complexity  classes  and  the  corresponding  lattice  of 
containments  shown  below.  With  immense  feelings  of  relief  (mixed  with  not  a  little  irony) 
after  we  had  suffered  through  most  of  this,  we  discovered  the  newly  published  “Handbook 
of  Theoretical  Computer  Science  Volume  A:  Algorithms  and  Complexity”.  Most  of  our 
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lattice  can  be  gleaned  from  the  collection  of  lattices  provided  by  Johnson  in  Chapter  2  of 
this  bock  [22].  In  general,  four  chapters  in  this  book  are  excellent  sources  for  other  facets  of 
our  survey.  We  refer  our  readers  to  Chapter  1  [23]  by  van  Emde  Boas  for  machine  models, 
Chapter  2  [22]  by  Johnson  on  complexity  classes  and  their  interrelationships,  Chapter  14 
by  Boppana  and  Sipser  [24]  for  the  Boolean  circuit  approach  to  complexity  and  to  Chapter 
17  [25,26]  by  Karp  ajid  Ramachandran  for  parallel  computation. 

Other  good  references  on  various  aspects  of  complexity  that  were  especially  helpful  to 
the  author  include  [27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50, 
51,52,53,54,55,56]. 

The  types  of  problems  of  concern  to  us  come  in  different  varieties,  (l)  decision 
problems — often  called  simply  “yes-no”  problems  (“Does  graph  G  have  a  perfect  match¬ 
ing?”);  (2)  search  problems  (“Given  graph  G,  find  a  maximum  independent  set  of  vertices 
in  G.”);  and  (3)  counting  problems  (“Given  a  graph  G,  compute  $(G),  the  number  of 
perfect  matchings  in  G.”). 

This  would  seem  to  be  an  appropriate  place  to  be  more  precise  about  decision  versions 
of  our  two  fundamental  problems.  In  particular  the  “yes-no”  variants  of  matching  and 
vertex  packing  which  we  shall  most  often  refer  to  are: 

(1)  Given  graph  G  and  k  >  0,  does  G  have  a  matching  of  size  >  kl 

(2)  Given  graph  G  and  A:>0,  does  G  have  an  independent  set  of  size  >  k? 

We  will  call  problem  (l)  the  matching  problem  and  denote  it  by  MATCH.  Problem 
(2)  will  be  called  the  vertex  packing  problem  and  will  be  referred  to  as  VP.  Sometimes 
others  have  been  known  to  call  the  perfect  matching  problem  by  our  term  MATCH,  but 
this  should  cause  us  no  problems  in  this  paper. 

Now . as  to  complexity  classes.  In  Section  1,  we  began  our  discussion  with  the 

matching  problem,  a  problem  known  to  have  a  polynomial  solution.  A  problem  is  in  class 
P  if  there  is  an  algorithm  to  solve  it  which  runs  in  time  polynomial  in  the  size  of  the  input 
function.  In  other  words,  the  algorithm  always  terminates  in  a  number  of  steps  polynomial 
in  input  size. 

Historically,  it  is  fair  to  say,  one  of  the  most  important  roots  of  computational  com¬ 
plexity  theory  is  the  concept  of  the  class  NP.  These  are  the  problems  which  can  be 
solved  in  non-deterministic  polynomial  time.  We  will  mostly  avoid  the  treatment  of  non- 
deterministic  machines  ajid  circuits  by  instead  explaining  NP  via  the  “certificate”  method. 
The  NP  machine  is  allowed  to  “guess”  a  solution  to  a  given  problem  (or  “consult  an  or¬ 
acle”),  but  then  must  provide  a  certificate  for  the  solution  which  can  be  checked  in  poly¬ 
nomial  time.  For  example,  the  problem  “Does  graph  G  contain  a  Hamilton  cycle?”  is  in 
class  NP,  for  given  an  ff-cycle  in  G,  one  can  convince  himself/herself  that  it  is  indeed  a 
Hamilton  cycle.  (Just  draw  it!  That  is,  output  it  as  an  alternating  sequence  of  vertices  and 
edges  and  then  check  the  sequence  to  be  sure  that  all  the  vertices  of  V(G),  except  exactly 
one,  appear  once,  and  this  one  exception  (the  “beginning”  and  “end”  vertex)  appears  ex¬ 
actly  twice.  Finally,  check  that  each  edge  in  the  sequence  joins  the  vertex  preceding  it  in 
the  sequence  to  the  vertex  succeeding  it.) 

It  is  crucial  to  understanding  the  concept  of  NP  that  we  realize  that  how  the  Hamilton 
cycle  was  found  in  the  first  place  is  irrelevant;  we  are  only  interested  in  certifying  in 


4 


polynomial  time  that  it  is  indeed  a  Hamilton  cycle. 

Now  what  if  the  input  graph  G  does  not  have  a  Hamilton  cycle?  No  “certificate” 
of  this  fact  in  the  above  sense  is  known.  The  opposite  “yes-no”  situation  occurs  for  the 
following  question:  “Is  every  maximal  independent  set  in  graph  G  of  the  same  size?”. 
(Such  graphs  axe  called  well-covered  or  w-c  in  short  and  were  introduced  in  [57].)  This 
time  if  the  answer  is  “no” ,  it  is  easy  to  certify.  Just  exhibit  two  maximal  independent  sets 
of  different  size.  Note  that  maximality  is  easy  to  check.  But  if  the  answer  is  “yes” ,  there 
is  no  known  certification. 

Define  the  class  co-NP  to  be  the  class  of  all  “yes-no”  problems  such  that  if  the  answer 
is  “no”,  it  can  be  certified  in  polynomial  time.  So  the  Hamilton  cycle  problem  is  in  NP 
and  the  well-covered  graph  problem  is  in  co-NP. 

Clearly,  P  C  NP  fl  co-NP.  Does  equality  hold?  Is  P  =  NP?  Is  NP  —  co-NP?  All  are 
open  questions. 

Next  let  us  try  to  formalize  the  idea  of  a  “hardest”  problem  in  a  complexity  class. 
We  will  be  content  to  illustrate  with  class  NP.  We  begin  with  the  notion  of  a  polynomial 
transformation.  If  tt  and  ir'  are  decision  problems,  a  polynomial  transformation  from 
TT  to  tt'  is  a  function  /  from  inputs  x  of  tt  to  inputs  /(i)  of  tt'  such  that  whenever  x  yields 
the  answer  “yes”  for  problem  tt,  /(x)  yields  the  answer  “yes”  for  problem  tt'.  Moreover, 
the  (deterministically-computable)  function  /(x)  can  be  computed  in  polynomial  time. 

The  complexity  theorist  would  hasten  to  add  that  the  concept  of  a  (polynomial) 
transformation  should  be  contrasted  with  the  more  general  notion  of  a  polynomial  (Turing) 
reduction  which  allows  multiple  calls  of  a  subroutine.  However,  we  shall  not  refer  to  the 
latter  type  of  reduction  here. 

A  problem  tt  is  NP-hard  if,  given  any  problem  tt'  in  NP,  there  is  a  polynomial 
transformation  of  every  instance  of  tt'  to  an  instance  of  problem  tt.  That  is,  if  we  can  solve 
TT  in  polynomial  time,  then  we  can  solve  tt'  polynomially  as  well.  If,  in  addition,  problem  tt 
belongs  to  NP,  we  say  that  tt  is  NP-complete.  That  is  to  say,  it  is  a  “hardest”  problem  in 
NP.  The  first  NP-complete  problem  was  found  by  Cook  [58]  in  1971.  It  is  the  satisfiability 
problem  of  logic  (or  “SAT”  for  short).  Suppose  formula  F  is  the  conjunction  of  a  set  of 
disjunctions  of  propositional  variables.  Is  there  an  assignment  of  trues  and  falses  to  the 
literals  making  F  true? 

Levin  [59]  (see  also  [60]  in  which  a  corrected  translation  of  Levin’s  article  appears 
as  an  appendix)  independently  and  essentially  simultaneously  arrived  at  the  notion  of 
NP-completeness. 

Shortly  after  Cook’s  ground-breaking  result,  Karp  [6l|  published  a  list  of  21  additional 
NP-complete  problems.  One  of  these  was  VERTEX  COVER  defined  as  follows.  “Given  a 
graph  G  and  an  integer  fc  >  0,  does  G  have  a  set  of  k  or  fewer  vertices  which  collectively 
touch  all  the  edges  of  G?”  Since  the  complement  of  a  vertex  cover  is  an  independent  set, 
VERTEX  COVER  is  clearly  equivalent  to  the  vertex  packing  problem  VP  defined  earlier. 

It  makes  sense  to  ask  if  there  are  “complete”  problems  for  other  complexity  classes 
as  well.  For  example,  for  class  P.  One  must  be  careful,  though,  to  carefully  define  the 
allowable  types  of  transformations.  For  example,  with  polynomial  transformations  allowed, 
every  problem  in  P  would  be  “complete”!  This,  then,  hardly  captures  a  useful  meaning 
for  a  “hardest”  problem  in  P.  In  the  case  of  class  P,  a  different  kind  of  transformation 
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is  employed — so-called  log-space  transformations.  In  terms  of  a  worktape  of  a  Turing 
machine,  a  log-space  transformation  is  one  that  is  computable  in  logarithmic  space  on 
the  worktape.  Note  that  these  transformations  are  also  polynomial  transformations  since 
a  logarithmic  workspace  bound  means  that  only  a  polynomial  number  of  distinct  states  on 
the  worktape  aje  possible. 

In  this  more  restricted  sense  of  log-space  transformation,  class  P  still  has  complete 
problems;  probably  the  most  famous  of  these  is  the  linear  programming  problem  [62]. 
There  is,  however,  another  P-complete  problem  closer  in  spirit  to  our  paper.  This  is 
LFMAXLVP — lexicographically  first  maximal  independent  set  [63,64].  Label  the  vertices 
in  the  graph  with  the  integers  1,2, ...  ,n  and  begin  to  build  a  maximal  independent  set  I 
by  putting  vertex  1  into  /,  deleting  {1}  U  iV(l),  where  N{1)  denotes  the  set  of  neighbors 
of  the  vertex  1.  Select  the  vertex  with  smallest  label  remaining  and  put  it  into  set  I. 
Delete  it  and  all  its  neighbors  and  continue  in  this  manner  until  all  vertices  have  either 
been  included  in  I  or  discarded.  The  “yes-no”  question  is  then:  “Is  vertex  n  in  set  11”. 
(Throughout  this  paper,  n  will  refer  to  the  number  of  vertices  in  a  graph  and  m  to  the 
number  of  edges,  unless  otherwise  stated.) 

Another  question  close  to  home  for  us  in  this  paper  is:  “Is  the  perfect  matching  prob¬ 
lem  P-complete?”  The  evidence  so  far  leads  most  workers  active  in  the  area  to  conjecture 
that  the  answer  is  “no”,  but  the  problem  remains  open.  (Cf.  [63,64].) 

With  these  basic  locations  on  our  lattice  behind  us,  let  us  continue  to  discuss  some 
additional  classes.  In  fact,  let  us  consider  parallel  computation  classes  next.  A  problem  is 
said  to  be  in  parallel  class  NC*  if  it  can  be  solved  in  0((logn)*)  time  with  a  polynomial 
number  of  processors.  Thus  i  is  the  degree  of  the  time-bounding  polynomial  in  logn.  The 
absence  of  an  index  on  the  processor  bound  in  naming  this  class  would  seem  to  support 
the  contention  that  “time  is  money”  and  “processors  are  cheap”!  To  be  fair,  it  should 
be  emphasized  that  many  complexity  theorists  concern  themselves  with  the  time  versus 
hardware  tradeoff  by  considering  the  product  of  the  time  and  processor  bounds  as  the 
“true”  measure  of  efficiency. 

The  class  NC  =  U^j^NC*  is  called  “Nick’s  class”  in  honor  of  its  first  proponent,  Nick 
Pippenger  [65]. 

We  shall  consider  our  parallel  processing  to  be  done  on  a  CREW  PRAM,  a  Con¬ 
current-Read  Exclusive  Write  Parallel  Random  Access  Machine.  In  such  a  device,  two 
processors  may  read  the  contents  of  the  same  memory  cell  at  the  same  time,  but  only  one 
at  a  time  may  write  in  such  a  cell.  Some  of  the  results  mentioned  later  in  the  sections 
devoted  to  parallel  problems  may  refer  to  the  more  restricted  family  of  EREW  P  RAM’s 
where  the  acronym  is  self-explanatory.  However,  the  differences  which  arise  between  these 
two  models  usually  mean  only  a  difference  in  the  “i”  part  of  NC‘  and  hence  no  difference 
in  whether  or  not  a  given  problem  is  in  class  NC.  For  this  reason  the  class  NC  is  often 
referred  to  as  being  “robust”. 

Although  we  will  have  much  less  to  say  about  it,  we  would  be  remiss  if  we  did  not  at 
least  mention  the  Boolean  circuit  model  of  parallel  computing.  In  this  model  the  class  NC’ 
is  defined  as  a  set  of  languages  (i.e.  subsets  of  {0,1}*)  recognizable  (i.e.,  “accepted”)  by 
classes  of  Boolean  circuits  having  a  polynomial  number  of  “and”,  “or”  and  “not”  gates  and 
having  depth  0((log  n)’).  (These  circuits  can  be  thought  of  as  directed  acyclic  graphs  with 
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certain  vertices  designated  as  inputs  (indegree  =  0)  and  a  special  output  vertex  (outdegree 
=  0).  The  depth  of  the  circuit  is  the  length  of  any  longest  path  from  an  input  to  the 
output  and  is  the  circuit  analog  of  “parallel  time”.  In  addition,  the  class  of  Boolean  circuits 
is  usually  assumed  to  be  log-space  uniform.  (A  class  of  Boolean  circuits  is  said  to 

be  log-space  uniform  if  there  is  a  deterministic  Turing  machine  which,  for  each  n,  can 

construct  circuit  Bn  in  space  C>(logn).  (See  [22]  and  [24].) 

1  0 

In  summary,  then,  we  have  NC  "  C  NC^  C  . . .  NC  C  P.  Whether  or  not  equality  holds 
at  any  stage  of  this  chain  is  unknown.  We  have  here  at  the  onset  still  another  unsettled 
question  about  matching.  Does  the  perfect  matching  problem  lie  in  NC? 

Let  us  next  give  a  brief  introduction  to  randomized  complexity  classes.  The  main 
motivation  here  is  the  following.  It  may  be  the  case  that  there  is  an  algorithm  for  a 
certain  problem  which  gives  the  correct  answer  most — but  not  all — of  the  time.  However, 
its  execution  time  is  faster  than  some  deterministic  alternative.  In  other  words,  it  may  be 
possible  (and  often  desirable)  to  trade  some  accuracy  for  speed. 

Let  us  begin  with  the  random  complexity  class  RP  (for  “random”  polynomial  time). 
These  are  the  problems  for  which  there  exist  algorithms  which  behave  as  follows.  If  the 
correct  answer  to  an  input  is  “yes” ,  the  algorithm  returns  the  answer  “yes”  with  probability 
>  p,  where  p  is  some  fixed  probability  bounded  away  from  0.  (Often  this  probability  is 
given  the  value  1/2  in  this  definition,  but  any  fixed  positive  value  will  do.)  However,  if  the 
correct  answer  is  “no” ,  the  algorithm  returns  the  incorrect  answer  “yes”  with  probability  = 
0.  Moreover,  it  does  this  in  polynomial  time.  Such  algorithms  are  said  to  have  “one-sided 
error”.  In  other  words,  if  the  algorithm  ever  answers  “yes”,  one  can  be  certain  that  “yes” 
is  indeed  the  correct  answer! 

The  reader  is  warned  that  some  authors  call  all  randomized  algorithms  Monte  Carlo, 
while  others  reserve  this  term  for  those  algorithms  with  one-sided  error,  like  those  in  class 
RP. 

Let  us  also  briefly  mention  a  class  containing  RP,  the  class  BPP.  (The  “B”  stands  for 
error  bounded  away  from  1/2.)  Members  of  this  class  are  those  problems  with  polynomial 
algorithms  which  return  the  correct  answer  (be  it  “yes”  or  “no”)  with  probability  >  1/2-f  e, 
for  some  e  >  0.  For  this  reason,  BPP  is  called  a  “two-sided  error”  clciss.  For  a  BPP  problem 
one  can  rerun  a  given  input  a  number  of  times  and  take  the  majority  answer.  The  more 
nms  the  more  probable  that  the  majority  answer  is  correct. 

Finally,  we  mention  the  randomized  complexity  class  ZPP.  (The  “Z”  stands  for  “zero 
error”.)  Here  the  algorithm  either  gives  the  correct  answer  (be  it  “yes”  or  “no”)  in  poly¬ 
nomial  time  or  it  refuses  to  answer  at  all!  Moreover,  the  probability  of  no  answer  at  all 
is  less  than  1/2.  Thus,  while  BPP  and  RP  algorithms  may  “lie”,  a  ZPP  algorithm  never 
does!  Randomized  algorithms  which  do  not  lie  are  called  Las  Vegas  algorithms,  a  term 
due  to  L.  Babai. 

There  are,  of  course,  parallel  analogs  to  all  of  these  randomized  classes.  Of  these 
we  shall  be  concerned  in  this  paper  only  with  RNC  (and  one  lonely  example  in  ZNC!). 
Clearly,  NC  C  RNC  by  definition,  but  whether  or  not  equality  holds  is  another  important 
open  question. 

Now  let  us  turn  to  a  problem  of  a  different  nature.  To  set  the  stage,  let  us  backtrack 
a  bit.  In  defining  all  the  complexity  classes  up  to  this  point,  we  have  dealt  exclusively  with 
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decision  problems;  i.e.,  problems  which  have  answers  which  are  either  “yes”  or  “no”.  Of 
course  there  are  other  types  of  problems  which  we  would  like  to  consider.  Search  problems, 
for  example,  are  especially  central  to  this  paper.  Examples  central  to  our  theme  are:  given 
a  graph  “find  the  size  of  a  maximum  matching”  or  “find  the  size  of  a  maximum  indepen¬ 
dent  set” .  In  more  extensive  treatments  of  complexity  (cf.  again  [22])  such  distinctions 
are  discxissed  at  length  aind  even  separate  complexity  classes  are  defined  to  reflect  these 
differences.  For  example,  class  FP  (“F”  is  for  “fimction”)  is  defined  as  the  search  analog 
of  decision  clatss  P.  But  even  more  often  in  the  existing  literature,  such  distinctions  are 
glossed  over.  In  the  interests  of  efficiency  in  attaining  the  goals  of  this  survey  paper  while 
the  reader  is  still  awake,  we  shall  also  ignore  such  distinctions  for  the  most  part. 

Having  said  that,  however,  we  next  treat  yet  a  third  type  of  problem  of  considerable 
interest  to  graph  theorists.  These  are  the  so-called  counting  problems.  To  be  sure,  such 
problems  are  “functional”,  but  the  output  in  this  case  is  a  non-negative  integer.  A  non- 
deterministic  Turing  machine  which,  given  an  input  string  i,  outputs  f{x)  =  the  number 
of  accepting  computations  for  this  input  string,  is  called  a  counting  Turing  machine.  The 
class  is  defined  to  be  the  set  of  all  functions  that  are  computable  by  polynomial  time 
counting  Turing  machines. 

It  is  perhaps  most  instructive  to  think  of  a  function  as  one  with  the  “magical” 
property  that  it  instantly  prints  out  the  number  of  acceptable  computations  of  an  associ¬ 
ated  polynomial  time  non-deterministic  Turing  machine. 

Some  familiar  examples  of  ^P  problems  are: 

(a)  Given  graph  G,  how  many  Hamilton  cycles  does  it  contain? 

(b)  Given  graph  G,  how  many  3-colorings  of  its  vertices  are  there? 

(c)  Given  graph  G,  how  many  perfect  matchings  does  it  contain? 

A  problem  is  #P-hard  if  there  are  polynomial  time  Turing  reductions  from  all  prob¬ 
lems  in  #P  to  the  given  problem.  If,  in  addition,  the  problem  belongs  to  the  class  as 
well,  then  we  say  that  the  problem  is  #P-complete. 

The  main  new  idea  is  that  for  ^P,  the  polynomial  transformations  from  problem 
to  problem  are  required  to  be  parsimonious,  that  is,  they  must  preserve  the  number  of 
solutions.  Valiant  [66,67]  showed  that  the  problem  of  the  number  of  perfect  matchings  in 
a  graph  (we  shall  call  it  :j^PM)  is  #P-complete  and  therefore  NP-hard,  even  when  the 
graph  is  bipartite.  This  problem  is  the  main  reason  for  introducing  the  class  #P  in  the 
present  paper. 

Strictly  speaking,  it  does  not  make  sense  to  ask  if  “NP  is  a  subclass  of  #P”,  since  the 
former  is  a  set  of  languages  (a  language  in  turn  is  just  a  set  of  strings),  but  ^P  is  a  set 
of  functions  from  strings  to  the  non-negative  integers.  Nevertheless,  in  our  Framework  we 
have  drawn  a  dashed  arrow  from  NPUco-NP  to  #P  to  indicate  that  problems  in  #P  are 
harder  by  their  very  nature  to  compute  then  are  those  in  NPUco-NP. 

For  example,  a  #P-machine  will  compute  the  number  of  Hamilton  cycles  in  a  graph 
G.  If  that  number  equals  0,  then  G  has  no  Hamilton  cycle,  but  if  that  number  is  greater 
than  0,  it  does.  Thus  we  have  an  instant  answer  to  a  proven  NP-complete  problem! 

Here  we  bring  a  halt  to  our  discussion  of  complexity  classes.  The  reader  likely  will 
wmt  to  refer  to  our  “Framework”  illustration  to  see  the  containment  relations  known  to 
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exist  (at  this  time  anyway)  among  the  various  classes.  (See  [22]  for  those  classes  we  do  not 
discuss  here.) 

Finally,  we  will  defer  definitions  and  discussion  of  the  two  classes  D*’  and  CC  until 
Sections  4  and  5  respectively  where  we  encounter  them  for  the  first  and  only  times  in  this 
paper. 

The  reader  will  note  that  we  have  drawn  a  horizontal  dashed  line  across  the  Framework 
diagram.  Those  classes  below  the  line  are  those  known  to  be  polynomial  time  computable 
(where  in  some  of  these  classes  recall  that  randomization  is  allowed). 


3.  Matching  Variations  and  their  Complexity 

We  now  give  a  list  of  some  variants  on  the  standard  matching  problem  and  give  their 
complexity,  if  known. 

Recall  that  the  matching  problems — both  “perfect”  and  “maximum”  varieties — are 
known  to  be  in  P.  An  even  simpler  problem  (at  least  sequentially)  is  the  problem  of  finding 
a  maxima/  matching.  The  greedy  algorithm  will  solve  this  nicely.  Choose  a  edge  and  delete 
its  endvertices.  Choose  a  edge  in  the  remaining  graph  and  delete  its  endvertices.  Con¬ 
tinuing  in  this  manner  we  have  a  trivially  polynomial  (sequential)  algorithm  for  maximal 
matching. 

It  is  perhaps  surprising  then  that  the  following  problem  (MINIMUM  MAXIMAL 
MATCHING)  is  NP-complete. 

1.  Given  graph  G  and  integer  A:  >  0,  does  G  have  a  maximal  matching  of  size  <  kl 

NP-completeness  was  first  shown  by  Yannakakis  and  Gavril  [68].  In  fact,  they  proved 
NP-completeness  even  in  the  cases  where  G  has  maximum  degree  at  most  3  and  is  either 
planar  or  bipartite.  The  same  two  authors  found  an  0[n)  algorithm  for  the  problem  in 
the  Ccise  when  the  graph  G  is  a  tree. 

Chronologically,  one  of  the  first  generalizations  of  matching  to  be  shown  to  be  NP- 
complete  was  the  S-dimtnsional  matching  problem  (3DM).  This  is  most  easily  described 
as  follows.  Let  H  he  a.  3-uniform  hypergraph  (i.e.,  each  “edge”  contains  three  vertices,  not 
two). 

2.  Given  H,  is  there  a  perfect  matching  of  hyperedges,  i.e.,  a  set  of  hyperedges  such  that 
each  vcrte.x  of  H  lies  in  exactly  one  hyperedge? 

This  problem  was  one  of  Karp’s  original  21  [61]  and  the  proof  is  by  reduction  from 
3SAT. 

The  NP-completeness  of  3DM  easily  implies  that  the  following  two  matching  problems 
are  NP-coraplete  (see  [69]). 

3.  Given  a  bipartite  graph  G  and  a  partition  E{G)  =  Ei  U  •  ■  •  U  Ek  of  its  edges,  does  G 
contain  a  perfect  matching  F  such  that  either  £,  C  F  or  fJ,  n  F  =  0,  for  all  :  =  1, . . . ,  kl 

4.  Given  a  bipartite  graph  G  and  a  coloring  of  its  edge  set  E[G),  does  G  contain  a  perfect 
matching  with  exactly  one  edge  of  each  color? 
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Motivated  by  the  fact  that  certain  timetable  and  image  analysis  problems  can  be  mod¬ 
eled  as  matching  problems  with  certain  additional  restrictions,  Itai,  Rodeh  and  Tanimoto 
[70j  introduced  the  RESTRICTED  MATCHING  problem. 

5.  Given  a  bipartite  graph  G,  a  collection  iEi, . . . ,  i?fc  of  subsets  of  E{G)  ajid  a  collection  of 
nonnegative  integers  ri, . . .  ,rfc,  does  G  have  a  perfect  matching  F  such  that  \F  Pi  i2,|  <  r, 
for  i  =  1, . . . ,  k? 

If  A:  is  set  equal  to  1  in  problem  5,  the  resulting  problem  is  equivalent  to  finding  a 
perfect  matching  using  as  few  edges  of  i2i  as  possible  and  this  is  a  very  special  case  of  the 
minimum  weight  perfect  matching  problem  which  was  shown  to  be  polynomially  solvable 
by  Edmonds  in  1965  [7l|. 

Note  that  since  k  is  unrestricted  in  problem  5,  problem  4  is  a  special  case  of  problem 
5  and  hence  the  latter  is  NP-complete  also.  However,  if  k  is  restricted  to  a  fixed  value, 
we  have  yet  another  problem,  this  one  introduced  by  Papadimitriou  and  Yannakakis  [72;. 
(See  also  [73].) 

6.  Let  the  positive  integer  k  have  a  fixed  value  (say,  10,  for  example).  Given  a  bipartite 
graph  G  with  its  edges  colored  in  k  colors  and  a  set  of  non-negative  integers  ci,...,cfc, 
does  G  contain  a  perfect  matching  F  such  that  F  contains  <  c,  edges  of  color  r,  for  all  :? 

The  difference  between  problem  6  and  problem  5  is  important  to  see,  although  it  is 
somewhat  subtle.  In  problem  6,  the  number  of  edge  classes  (i.e.,  “colors”)  is  fixed  or 
bounded;  in  problem  5,  k  is  not  bounded,  but  may  assume  values  as  large  as  one  likes  as 
part  of  the  input  string  to  the  problem. 

Papadimitriou  and  Yannakakis  [72]  showed  problem  6  to  be  polynomially  equivalent 
to  five  other  problems,  including  the  next  problem  in  our  list — EXACT  MATCHING. 

7.  Given  a  graph  G  and  a  set  of  distinguished  edges  R  C  E(G)  (call  these  edges  “red”) 
and  an  integer  A:  >  0,  is  there  a  perfect  matching  of  G  which  contains  exactly  k  red  edges? 

Note  that  EXACT  MATCHING  can  also  be  thought  of  as  the  special  case  when  k  =  2 
and  n  iZj  =  0  of  the  RESTRICTED  MATCHING  problem. 

Although  the  complexity  of  problems  6  and  7  remains  unknown,  even  for  bipartite 
graphs,  Barahona  and  Pulleyblank  [74]  have  shown  EXACT  MATCHING  to  be  polyno¬ 
mially  solvable  when  G  is  Pfaffian.  The  class  of  Pfaffian  graphs  is  discussed  in  [l]  and 
we  will  meet  them  again  in  Section  7  of  the  present  paper.  Suffice  it  to  say  here  that 
they  include  all  /Cs^s-free  graphs  and  hence  all  planar  graphs  as  well.  More  recently,  V.V. 
Vazirani  [75,76]  has  shown  EXACT  MATCHING  to  be  in  parallel  class  NC  for  ifs  3-free 
graphs.  We  shall  return  to  EXACT  MATCHING  in  Section  5. 

Given  a  matching  M  in  a  graph  G,  C  is  an  alternating  cycle  with  respect  to  M  if 
C  is  a  (necessarily  even)  cycle  in  which  every  second  edge  belongs  to  M.  M<.cching  M  is 
alternating  cycle  free  if  no  such  C  exists.  To  test  if  a  graph  G  has  an  alternating  cycle 
free  perfect  matching  can  be  done  in  polynomial  time,  for  clearly  G  has  a  cycle  free  perfect 
matching  if  and  only  if  G  has  a  unique  perfect  matching.  To  test  for  the  latter  property, 
let  \f  =  (ci, . . . ,  671/2}  be  any  perfect  matching  for  G  and  test  G  —  e,  for  each  i  in  turn  to 
see  if  it  contains  a  perfect  matching,  using  the  polynomial  algorithm  of  Edmonds  or  any 
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of  its  variants. 

On  the  other  hand,  consider  the  following  two  related  problems. 

8.  Does  a  bipartite  graph  G  have  a  perfect  matching  which  has  no  alternating  4-cycle? 

9.  Given  a  bipartite  graph  G  and  an  integer  k  >  Q,  does  G  have  an  alternating  cycle  free 
matching  of  size  at  least  fc? 

Pulleyblank  [77]  has  proved  both  these  problems  NP-complete  by  reducing  3SAT  to 
each.  This  work  was  done  in  connection  with  nainimizing  setups  in  precedence  constrained 
scheduling. 

Varying  our  demands  somewhat  yet  again,  let  us  call  a  matching  M  in  graph  G 
induced  if  no  two  edges  of  M  are  joined  by  a  edge  in  E{G)  —  M.  Consider  now  the 
induced  matching  problem. 

10.  Given  graph  G  and  an  integer  A:  >  0,  is  there  an  induced  matching  in  G  of  size  >  k? 

This  problem  has  been  proved  NP-complete  even  for  bipartite  graphs  by  Stockmeyer 
and  V.V.  Vazirani  [78]  and  later,  independently,  by  Cameron  [79].  (The  first  two  authors 
actually  showed  more.  Let  the  distance  between  two  edges  e  and  e'  be  the  length  of  a 
shortest  path  from  an  endvertex  of  e  to  an  endvertex  of  e'.  A  ^-separated  matching 
M  is  a  matching  in  which  the  distance  between  any  two  edges  is  at  least  6.  Stockmeyer 
and  Vazirani  showed  that  for  each  S  >  2  the  problem  “Given  a  graph  G  and  an  integer 
A:  >  0,  does  G  have  a  (5-separated  matching  of  size  at  least  k?”  to  be  NP-complete,  even 
for  bipartite  graphs  regular  of  degree  4.) 

On  the  other  hand,  Cameron  showed  maximum  induced  matching  (=  maximum  2- 
separated  matching)  is  polynomial — in  fact  in  NC — for  all  chordal  graphs.  (A  graph  is 
chordal  if  every  cycle  of  length  at  least  4  has  a  chord,  that  is,  a  edge  not  in  the  cycle,  but 
which  joins  two  vertices  of  the  cycle.) 

In  the  Stockmeyer  and  Vazirani  paper,  two  other  interesting  variations  are  shown  to 
be  NP-complete.  These  are  maximum  TR-matching  and  maximum  star  matching.  Both 
are  relevant  to  network  testing  of  various  sorts. 

A  T i2-matching  in  graph  G  is  a  pair  (M,  A)  where  M  is  a  matching  and  A  is  a 
labeling  function  which  assigns  to  each  vertex  of  G  one  of  the  three  labels  from  {T,  i2.  A}. 
Here  “T”  stands  for  “transmitter”,  “iZ”  for  “receiver”  and  “A”  for  “neither”.  In  addition, 
the  labeling  A  is  subject  to  the  following  conditions: 

(a)  A(v)  =  A  whenever  M  does  not  cover  vertex  v, 

(b)  if  edge  uv  £  M,  then  precisely  one  of  u  and  v  has  label  T,  the  other  R,  and 

(c)  if  uv  6  E(G)  —  M,  then  {A(u),A(u)}  ^  {R,T}. 

Condition  (c)  says  that  no  'ransmitter  is  connected  to  a  receiver  other  than  the  one 
to  which  it  is  matched  by  M.  The  size  of  a  TR  matching  is  the  cardinality  of  M. 

11.  Given  graph  G  and  integer  A:  >  0,  is  there  a  TiZ-matching  in  G  of  size  at  least  kl 

The  motivation  for  this  problem  is  reasonably  clear.  We  want  to  test  the  network  by  send¬ 
ing  a  signal  simultaneously  from  all  transmitters  (T’s)  to  their  receivers  (iZ’s).  Condition 

(c)  precludes  the  possibility  of  “jamming”;  that  is,  receivers  receiving  test  signals  from  two 
different  transmitters. 
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Star- matching  arises  from  a  different  testing  procedure.  A  star-matching  is  simply 
a  labeling  function  A  from  the  set  of  vertices  V (G)  into  the  set  {T,R}  such  that  for  each 
vertex  u  with  A(u)  =  R  there  exists  another  vertex  v  adjacent  to  u  such  that  A(t;)  =  T. 
The  size  of  a  star-matching  is  the  size  of  the  set  of  vertices  having  R  labels. 

12.  Given  graph  G  and  integer  A:  >  0,  does  G  have  a  star- matching  of  size  at  least  k? 

Problems  11  and  12  were  shown  to  be  NP-complete  for  bipartite  graphs  by  Even, 
Goldreich  and  Tong  [80].  Stockmeyer  and  Vazirani  show  them  to  be  NP-complete  for  all 
cubic  graphs. 

Returning  to  induced  matching  for  a  moment,  we  point  out  that  Stockmeyer  and 
Vazirani  refer  to  it  as  “risk-free  marriage”!  (A  moment’s  reflection  on  the  part  of  the 
reader  will  undoubtedly  reveal  why.)  This  leads  us  to  another  matching  variation  which 
has  been  widely  studied  over  the  past  twenty  years  or  so:  The  Stable  Marriage  Problem. 

Fortunately,  for  this  problem  we  have  a  quite  recent  and  comprehensive  survey  in  the 
form  of  the  book  of  Gusfield  and  Irving  [8l|. 

As  in  the  case  with  matching  in  general,  studies  here  naturely  divide  into  the  treatment 
of  the  bipartite  case  (the  Stable  Marriage  Problem)  and  the  general — i.e.,  not  necessarily 
bipartite— case  (the  Stable  Roommates  Problem). 

We  treat  the  bipartite  version  first.  Suppose  we  have  a  bipartite  graph  G  with  vertex 
bipartition  V {G)  =  AU  B  and  suppose  that  |A|  =  |5|.  Each  man  (member  of  A)  and 
each  woman  (member  of  B)  has  a  complete  list  of  preferences  for  the  member  of  the  set  of 
the  opposite  sex  which  is  a  strict  order  relation.  A  matching  M  is  said  to  be  unstable  if 
there  exists  a  man  and  woman  who  are  not  matched  by  M,  but  each  prefers  the  other  over 
his/her  partner  in  the  matching  M.  A  matching  is  said  to  be  stable  if  it  is  not  unstable. 
The  problem  then  is: 

13.  Given  a  bipartite  graph  as  above  with  ranked  preferences,  does  G  have  a  perfect 
matching  which  is  stable? 

Of  course  as  usual,  there  are  really  two  fundamental  related  problems  here.  First, 
does  a  stable  matching  exist  and  second,  if  so,  can  it  be  found  efficiently.  The  answer, 
fortunately,  is  “yes”  to  both  questions. 

In  1962,  Gale  and  Shapley  [82]  showed  that  a  stable  matching  always  exists  and 
provided  an  0(n‘)  algorithm  to  find  one. 

The  more  general  problem  of  Stable  Roommates  is  defined  in  a  manner  similar  to 
the  Stable  Marriage  Problem,  except  that  the  underlying  graph  G  need  not  be  bipartite 
(although  it  should  have  an  even  number  of  vertices)  and  each  vertex  (=  student)  has  a 
strict  ordering  of  all  other  vertices  (=  possible  roommates). 

14.  Given  a  graph  G  on  an  even  number  of  vertices  with  preference  lists  for  each  vertex, 
does  G  have  a  stable  matching? 

The  fundamental  difference  between  the  Marriage  and  Roommates  problems  turns 
out  to  be  that  the  latter  need  not  have  a  stable  matching. 

The  complexity  of  the  Stable  Roommates  Problem  was  settled  in  1985  by  Irving  [831 
who  found  an  O(n^)  algorithm  which  either  outputs  a  stable  matching  or  indicates  that 
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none  exists. 

These  two  problems  are  unique  in  this  paper  in  the  sense  that  it  is  known  that  the 
algorithms  are  asymptotically  optimal.  That  is,  it  has  been  proved  that  any  algorithm  for 
finding  a  stable  matching  (even  in  the  bipartite  case)  must  require  ■’t  least  cn^  time,  for 
some  constant  c  >  0.  This  was  proved  by  Ng  and  Hirschberg  in  1988  [84]. 

If  one  relaxes  the  demand  on  strict  preference  lists  to  allow  the  possibility  of  ties,  one 
must  agree  on  a  suitable  redefining  of  stability.  However,  for  two  of  the  “most  natural” 
such  definitions  (called  super-stable  and  strongly  stable  by  Gusfield  and  Irving),  the  Gale- 
Shapley  algorithm  can  be  extended  to  produce  a  stable  matching  (or  report  the  existence 
of  none)  in  polynomial  time  in  the  bipartite  case. 

However,  for  the  non-bipartite  Roommates  Problem,  if  ties  are  allowed,  the  problem 
becomes  NP-complete.  This  was  proved  by  Ronn  [85,86]. 

There  are  a  host  c.  variations  on  Stable  Marriage  and  Stable  Roommates,  some  in  P, 
some  NP-complete  and  some  the  complexity  of  which  is  unknown.  (See  the  Gusfield  and 
Irving  book  [81].) 

We  will  finish  our  treatment  with  one  more  such  variation.  Note  that  we  now  return 
to  the  original  requirement  that  the  preferences  be  strict.  For  a  roommates  matching  M, 
let  the  value  of  M  be 


v{M)  =  ^  (r(u,t;)  +  r(u,u)) 

uu6M 

where  r(u,v)  denotes  the  ranking  of  v  by  u.  The  Optimal  Roommates  Problem  is: 

15.  Given  a  graph  G,  compute  the  minimum  value  of  v{M)  taken  over  all  stable  matchings 
M. 

Feder  [87]  has  recently  proved  that  this  problem  is  NP-complete,  but  the  bipartite  (or 
Marriage)  version  is,  in  fact,  polynomial. 

We  conclude  by  reporting  that  the  problem  of  counting  stable  matchings,  even  in  the 
bipartite  case,  is  ^P-complete  [86]. 

Now  we  veer  in  a  different  direction.  Maximum  matching  can  be  thought  of  as  a 
“parking  problem”  in  which  one  wants  to  find  the  largest  number  of  edges  which  are 
mutually  vertex-disjoint.  Suppose  now  we  attempt  to  generalize  the  notion  of  “edge”  in 
the  packing  to  another  type  of  subgraph  Let  us  call  the  following  the  H-matching  problem 
and  denote  it  by  HMATCH. 

Given  graphs  G  and  H  is  there  a  spanning  subgraph  of  G  consisting  of  vertex-disjoint 
copies  of  HI 

Of  course  if  H  —  K2,  we  just  have  the  perfect  matching  problem  and  the  solution  is 
polynomial.  But  if  H  has  any  component  having  3  or  more  vertices,  Kirkpatrick  and  Hell 
[88,89]  proved  that  the  problem  is  NP-complete.  (See  also  [90,91].)  In  all  other  cases,  the 
problem  is  polynomial. 

We  can  modify  this  problem  in  yet  another  way.  Instead  of  one  fixed  graph  H ,  as  our 
subgraph  to  be  packed,  let  us  allow  a  choice  of  graphs  from  a  certain  specified  family.  Let 
U  =  {Hi,  H2, . . . , }  be  a  finite  or  infinite  family  of  graphs.  An  ^/-factor  for  G  is  then  a 
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vertex-disjoint  collection  of  subgraphs  of  G  which  together  cover  V[G)  and  each  member 
of  the  collection  comes  from  M.  For  example,  if  ^  =  {C3,  C4, . . the  family  of  all  cycles, 
then  an  ^-factor  for  G  just  becomes  a  2-factor. 

Hell  and  Kirkpatrick  study  a  variety  of  possibilities  for  M .  For  example,  if  ^1  is  the 
set  of  all  cycles  just  mentioned,  the  problem  is  polynomial.  If  I/2  =  {-^^2}  U  I/i,  the  packing 
sought  is  called  a  perfect  2-matching  and  again  the  problem  is  polynomial. 

If  ^3  =  {Ki,n\n  >  3}  (i.e.,  the  family  of  “s'  tjs”),  the  problem  is  NP-complete.  On 
the  other  hand,  if  we  add  to  ^3  either  Ki  or  (or  both),  and  call  the  resulting  class  ^4, 
the  problem  clearly  becomes  polynomial.  A  close  relative  to  this  problem  is  obtained  as 
follows.  Let  1/5  be  any  subset  of  {Ki^n\n  >  1}  with  the  property  that  for  some  t,  K\^t  ^  ^5, 
but  G  ■Vs-  Then  the  I/5  problem  is  NP-complete. 

One  final  example  is  obtained  by  letting  "Hq  be  any  family  of  complete  graphs.  Then 
the  I/e  packing  problem  is  polynomial  if  Ky  or  is  in  I/e  and  NP-complete  in  all 
other  cases.  Proofs  of  all  these  results,  cis  well  as  other  related  results,  may  be  found 
in  [92,93,90,94,95,91,96,97]. 

More  recently,  the  /f-matching  problem  has  been  investigated  for  planar  graphs  [98]. 
(Here  again,  let  us  note,  H  is  a.  single  fijced  graph.)  If  H  =  K3  or  FsTi^s  (the  “claw”).  Dyer 
and  Frieze  [99]  showed  HMATCH  to  be  NP-complete  in  the  plane.  Even  more  recently, 
Berman  et  al.  [98]  have  shown  that  if  H  has  at  least  3  vertices,  then  maximum  planar 
matching  version  of  HMATCH  is  NP-complete.  Surprisingly,  perfect  planeu-  matching 
version  of  HMATCH  is  another  story!  If  H  has  at  least  3  vertices  and  is  connected  and 
outerplanar ,  they  show  that  the  problem  is  NP-complete.  On  the  other  hand,  if  If  is  a 
triangulated  graph  with  at  least  4  vertices,  there  is  an  0{n)  algorithm  for  the  problem. 

A  characterization  of  those  H's  for  which  perfect  planar  HMATCH  is  polynomial 
remains  unknown.  An  approximation  algorithm  for  planar  iif-matching  is  obtained  in 
1100], 

Now  let  us  look  at  a  quite  different  problem  associated  with  matching.  Edmonds 
showed  that  that  matching  can  be  formulated  zis  a  linear  program.  There  now  exist 
several  polynomial  LP  algorithms  such  as  the  ellipsoid  method  and  Karmarkar’s  method. 
The  problem  with  using  these  directly  is  that  there  exist  an  exponentially  large  number 
of  inequalities  which  must  be  furnished  in  order  to  formulate  the  matching  problem  as  an 
LP. 

However,  the  ellipsoid  method  has  the  property  that  if  the  inequalities  can  be  fed  to 
it  as  needed,  it  can  solve  the  LP  in  polynomial  time.  Padberg  and  Rao  [lOl]  described 
an  algorithm  to  do  this  for  the  matching  polyhedron.  Thus  their  algorithm,  when  used 
in  conjunction  with  the  ellipsoid  method,  provides  another  method  for  solving  matching 
problems  in  polynomial  time. 

But  can  one  somehow  solve  the  matching  problem  in  polynomial  time  using  a  more 
standard  LP  algorithm  like  the  aforementioned  simplex  method  or  Karmarkar’s  algorithm? 
This  remains  an  open  question. 

Barahona  [102]  has  made  some  progress  on  this  question  by  showing  that  one  can 
solve  matching  via  a  polynomial  number  of  LP’s,  each  polynomial  in  size.  He  has  also 
shown  that  in  the  case  when  G  is  planar,  this  polynomial  number  of  polynomial  size  LP’s 
can  indeed  be  reduced  to  only  one  [103].  It  should  be  mentioned  in  this  connection  that  a 
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planar  graph  can  indeed  have  an  exponential  number  of  facets  in  its  matching  polytope. 
(Cf.  Gamble  [104j.) 

On  the  negative  side,  however,  Yannakakis  [105]  has  shown  that  no  symmetric  LP 
formulation  of  polynomial  size  for  matching  on  the  complete  graph  K2n  is  possible.  (A 
formulation  of  the  matching  problem  is  symmetric  if  any  extra  variables  and  the  roles  they 
play  are  independent  (up  to  permutation)  of  the  order  in  which  the  graph  is  examined.) 

To  close  this  section  of  our  paper,  we  make  a  fleeting  visit  to  the  land  of  matroids.  For 
details  the  reader  may  refer  to  [l]  or  [106],  or  to  many  other  reference  sources  for  matroid 
theory.  Suppose  Ai  =  {^,1)  is  a  matroid  where  E  is  the  ground  set  and  /  is  the  family 
of  independent  subsets  of  E.  Suppose  F"  is  a  pairing  of  all  the  elements  of  the  ground  set 
E.  A  set  A  C  F  is  a  parity  set  if  for  every  element  e  6  A,  its  mate  under  F  is  also  in  A. 
Then  the  MATROID  PARITY  problem  is: 

Given  M  =  {E,I),  a  pairing  F  and  an  integer  A:  >  0,  is  there  a  parity  set  A  in  M  of 
size  at  least  k? 

MATROID  PARITY  is  provably  exponential  and  this  result  does  not  depend  upon 
the  assumption  that  P  NP!  This  was  shown  by  Lovasz  in  1978  [107].  If  the  matroid 
is  linear,  however,  he  gave  a  polynomial  algorithm,  albeit  relatively  slow.  More  recently, 
faster  algorithms  have  been  found  [108]. 


4.  Vertex  Packing  Variations  and  their  Complexity 

As  mentioned  earlier  in  this  paper,  the  vertex  cover  problem  is  one  of  the  original 
NP-complete  problems  in  the  list  of  Karp  [61].  Later  it  was  proved  NP-complete  even 
when  restricted  to  cubic  planar  graphs  [2l],  to  triangle-free  graphs  [109]  and  several  other 
families  listed  in  [21,110].  Although  vertex  packing  is  polynomial  for  Ki^s-free  graphs  (see 
Minty  [19]  and  Sbihi  [20]  mentioned  earlier),  it  is  still  NP-complete  for  K’i_4-free  graphs 
[19]!  (See  also  [ill].)  For  even  more  classes  for  which  vertex  cover  is  NP-complete,  see 
also  MaJiadev  [112]. 

Recall  that  earlier,  along  with  claw-free  graphs,  we  also  mentioned  line  graphs  and 
bipartite  graphs  as  classes  for  which  VP  is  polynomial.  There  are  several  other  classes  for 
which  polynomiality  for  vertex  packing  has  been  shown,  but  too  recently  to  appear  in  the 
Garey  and  Johnson  book. 

The  most  famous  of  these  is  surely  the  family  of  perfect  graphs.  Let  w(G)  denote  the 
size  of  any  largest  complete  subgraph  in  G.  This  is  called  the  clique  number  of  G.  Clearly, 
<  x(G)  for  graph  G,  where  x(^^)  denotes  the  chromatic  number  of  G.  Graph 
G  is  said  to  be  perfect  if  w(G')  =  x(<^0  f°r  every  induced  subgraph  G'  of  G.  The  concept 
of  perfection  is  due  to  Berge  [113]  in  the  early  1960’s  and  the  class  of  perfect  graphs  is  now 
known  to  include  many  other  well-known  families  such  as  bipartite  graphs,  line  graphs  of 
bipartite  graphs,  interval  graphs,  comparability  graphs  and  triangulated  graphs — as  well 
as  the  complements  of  all  such  graphs.  In  fact,  Lovasz,  in  a  celebrated  1972  result  [114], 
proved  that  for  any  perfect  graph  the  complement  must  also  be  perfect.  (For  general 
references  on  perfect  graphs,  we  recommend  [115],  [116]  as  well  as  [117,118,119].) 
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It  is  a  highly  non-trivial  fact,  proved  by  Grotschel,  Lovasz  and  Schrijver,  that  ver¬ 
tex  packing  is  polynomial  for  the  clciss  of  all  perfect  graphs.  The  proof  uses  the  ellipsoid 
method,  the  first  polynomial  LP  algorithm  to  be  discovered.  For  a  thorough  account  of 
this  see  [117,118,120,121,122].  In  fact,  they  prove  polynomiality  for  a  larger  class,  namely 
the  so-called  h-perfect  graphs.  The  h-perfect  graphs  are  defined  as  those  graphs  the  stable 
set  polytopes  of  which  are  defined  by  certain  families  of  linear  inequalities — in  this  case 
the  so-called  non-negativity  constraints,  clique  constraints  and  odd  cycle  constraints.  Un¬ 
fortunately,  no  purely  graph-theoretical  characterization  of  h-perfect  graphs  is  yet  known. 

One  last  remark  about  perfect  graphs  is  in  order.  Although  the  complexity  of  showing 
a  graph  to  be  perfect  is  not  known,  showing  imperfection  is  in  class  NP.  (According  to  Berge 
and  Chvatal  [116],  this  result  is  attributable  to  Edmonds  and  Cameron.)  For  additional 
work  along  these  lines,  see  references  [123,124]. 

Now  let  us  return  to  bipartite  graphs  and  recall  the  fundamental  result  of  Konig  which 
says  that  ^{G)  =  r(G)  where  u{G)  is  the  size  of  any  maximum  matching  and  t{G)  is  the 
size  of  any  minimiun  vertex  cover  [3,4].  This  is  an  archetypal  example  of  a  “minimax” 
theorem  in  graph  theory.  Such  theorems  have  grown  in  importance  since  the  discoveries 
of  the  last  twenty  years  or  so  indicating  that  graph  theory  and  linear  programming  can 
profitably  be  brought  together.  (See  [l]  for  just  one  of  many  references  on  this  subject. 
See  also  [2].) 

It  is  important  to  realize  that  the  above  minimax  equation  holds  for  some  non-bipartite 
graphs  as  well;  for  example,  consider  the  4-vertex  graph  obtained  by  attaching  a  pendant 
edge  to  a  triang'e.  Graphs  satisfying  the  minimax  equation  are  said  to  have  the  Konig 
Property.  First  of  all,  there  are  several  characterizations  of  these  graphs  which  lead  to 
polynomial  recognition  algorithms  for  the  graphs  in  the  class.  (See  [125,126,127,1].)  Triv¬ 
ially,  these  graphs  have  polynomial  algorithms  to  find  the  size  of  a  maximum  independent 
set  via  the  matching  algorithm.  However,  none  of  the  above  references  explicitly  gives 
a  polynomial  algorithm  for  finding  a  maximum  independent  set — i.e.,  the  search  prob¬ 
lem.  However,  in  [127],  there  is  a  polynomial  algorithm  for  the  search  problem  implicit  in 
Lemma  3.3.  This  result  depends  upon  the  so-called  Gallai-Edmonds  decomposition  of  a 
graph,  a  canonical  decomposition  of  graphs  in  terms  of  their  meiximum  matchings.  This 
decomposition  can  be  found  in  polynomial  time  via  Edmonds’  algorithm.  The  details  may 
be  found  in  [l].  Also  see  the  thesis  of  Korach  [128]. 

It  turns  out,  however,  that  a  polynomial  algorithm  for  finding  a  maximum  independent 
set  in  a  graph  with  the  Konig  Property  has  been  around  quite  a  bit  longer  than  the  method 
referred  to  in  [l].  Define  a  2-cover  of  a  graph  G  to  be  an  assignment  of  weights  0,  1  and 
2  to  the  vertices  of  G  such  that  the  sum  of  weights  of  the  two  endvertices  of  any  edge  is 
at  least  2.  The  sum  of  all  weights  is  called  the  size  of  the  2-cover.  The  minimum  size  of 
any  2-cover  of  G  is  denoted  by  T2{G).  It  can  be  shown  (see  Corollary  6.3.4  of  [l|)  that  a 
graph  G  has  the  Konig  Property  if  and  only  if  it  satisfies  T2{G)  =  2t[G).  In  a  1975  paper, 
Nemhaxiser  ajid  Trotter  [129]  gave  a  polynomial  algorithm  which,  when  applied  to  an 
arbitrary  graph,  either  produces  a  maximum  independent  set  or  shows  that  T2{G)  <  2r(G) 
and  thus  that  the  graph  does  not  have  the  not  Konig  Property.  (This  was  before  the  name 
“Konig  Property”  had  yet  been  coined,  however.)  For  further  discussion  on  this  approach, 
the  reader  is  referred  to  [130]  and  [131]. 
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Another  class  of  graphs  with  polynomial  algorithms  for  VP  are  the  well-covered  graphs 
mentioned  earlier.  Recall  that  a  graph  is  well-covered  if  every  majcimal  independent  set  is 
in  fact  maximum.  In  other  words,  these  are  the  graphs  for  which  the  greedy  algorithm  for  a 
maximal  independent  set  always  results  in  a  maximum  independent  set.  The  polynomiality 
for  VP  for  these  graphs  is  thus  trivial.  What  is  not  trivial  about  well-covered  graphs  is 
recognizing  them! 

A  moment’s  reflection  will  tell  the  reader  that  well-covered  graphs  are  in  co-NP.  In 
fact,  very  recently  it  has  been  shown  that  in  fact  well-covered  graph  recognition  is  co-NP- 
complete.  (See  [132,133].)  But  membership  in  NP  is  unsettled.  In  fact,  it  is  thought  by 
most  that  membership  in  NP  is  highly  unlikely,  for  such  a  result  would  imply  that  NP  = 
co-NP,  a  situation  considered  almost  as  unlikely  as  P  =  NP! 

The  concept  of  a  well-covered  graph  was  introduced  in  [57].  In  the  past  five  years  or 
so,  there  has  been  a  surge  of  interest  in  this  graph  class  and  the  interested  reader  may 
wcint  to  consult  the  forthcoming  survey  [134].  See  also  [135,133,132]. 

Next  we  would  like  to  briefly  discuss  the  so-called  a-critical  graphs.  Let  the  indepen¬ 
dence  number  of  graph  G  (i.e.,  the  size  of  any  largest  independent  set  in  G)  be  denoted 
by  a{G).  A  graph  G  is  said  to  be  a-critical  if  a(G  —  e)  >  a(G),  for  all  edges  e  G  E[G). 
It  may  well  be  that  the  recognition  problem  for  these  graphs  belongs  to  neither  NP  nor  to 
co-NP!  At  this  point,  no  one  knows. 

If  these  graphs  were  in  NP,  then  one  could  give  a  “good  characterization”  of  a(G)  for 
any  graph.  (In  other  words,  one  could  show  that  determining  q:(G)  belongs  to  NPnco  —  NP 
|123|.) 

There  has  been  quite  a  lot  of  Interest  in  obtaining  structural  properties  of  this  family 
of  graphs.  For  an  overview  of  the  structural  results  presently  known  for  these  graphs,  see 
[11- 

In  1965,  Hajnal  [136]  proved  that  in  any  a-critical  graph  G,  maxdeg  v  <  IV'IG)!  - 
2a(G)  -I-  1.  Denote  the  quantity  1V’{G)|  —  2a(G)  by  5(G).  Gallai  suggested  that  studying 
connected  a-critical  graphs  G  by  means  of  the  value  of  5(G)  might  prove  profitable.  The 
parameter  5(G)  has  come  to  be  called  the  Gallai  class  number  of  G  for  this  reason. 
Cleeirly  the  only  connected  a-critical  graph  with  5  =  0  is  K'2.  It  is  also  easy  to  see  that 
the  only  connected  a-critical  graphs  having  5(G)  =  1  are  the  odd  cycles.  From  this  point 
on,  the  situation  rapidly  becomes  more  complex. 

Andrasfai  [137]  showed  that  the  only  connected  a-critical  graphs  with  a(G)  =  2  are 
the  even  subdivisions  of  K4.  A  deep  result  due  to  Lovasz  some  eleven  years  later  [121] 
showed  that  for  any  value  of  5,  the  class  of  a-critical  graphs  G  having  5  =  5(G)  must 
arise  from  a  finite  class  of  “basis”  graphs  via  even  subdivisions.  For  5  =  3,  it  has  been 
shown  that  there  are  precisely  four  basis  graphs,  while  for  5  >  4,  it  is  known  only  that  the 
number  of  basis  graphs  is  bounded  above  by  a  rather  complicated  function  exponential  in 
5. 

Returning  to  Andrasfai’s  result  for  a  moment,  let  us  compare  it  with  a  result  of  Chvatal 
[138]  which  says  that  every  a-critical  graph  G  with  5(G)  >  2  must  contain  a  subdivision  of 
K4.  Moreover,  Chvatal  also  showed  that  any  graph  which  does  not  contain  a  subdivision 
of  K4  (i.e.,  a  so-called  series-parallel  graph),  has  a  polynomial  algorithm  for  VP. 

Chvatal  also  conjectured  that  all  a-critical  graphs  with  5  >  2  must  in  fact  contain  an 
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even  subdivision  of  K4.  This  conjecture  was  settled  in  the  affirmative  by  Sewell  [139,140j 
and  Sewell  and  Trotter  [141]  in  1990.  Moreover,  in  [139,140],  Sewell  also  showed  that 
if  graph  G  does  not  contain  an  even  subdivision  of  then  the  VP  problem  for  G  is 
polynomial. 

Thus  we  have  a  kind  of  “separation”  result  in  that  we  have  a  class  of  graphs  which 
are  not  a-critical,  but  which  have  a  polynomial  VP  algorithm.  It  is  interesting  that  the 
polynomial  VP  algorithm  of  Sewell  uses  the  ellipsoid  method  also.  It  is  an  open  question 
as  to  whether  use  of  the  ellipsoid  method  can  be  avoided  in  this  case. 

One  final  remark  is  in  order  regarding  a-critical  graphs.  We  stated  earlier  that  it 
may  well  be  that  recognizing  these  graphs  is  neither  in  NP  nor  in  co-NP.  What  then,  if 
anything,  can  be  said  about  the  complexity  of  this  problem? 

In  1982,  Papadimitriou  and  Yannakakis  [142],  in  an  attempt  to  classify  the  complexity 
of  facets  of  the  polytope  which  arises  in  the  LP  formulation  of  the  traveling  salesman 
problem,  defined  a  new  complexity  class  D^.  (See  also  [73,143,50].)  In  terms  of  languages 
(i.e.,  sets  of  binary  strings),  is  defined  as  the  set  of  all  languages  L\  n  where  Li  is 
in  NP  and  L2  is  in  co-NP.  In  terms  of  problems,  we  perhaps  can  best  illustrate  this  class 
with  the  example  of  EXACT  VERTEX  PACKING: 

Given  a  graph  G  and  an  integer  A:  >  0,  is  a(G)  =  kl 

This  can  be  thought  of  as  the  intersection  of  two  problems.  The  first  is  our  old  friend 
VP  which  says  “Given  G  and  k  >  0  does  G  have  an  independent  set  of  size  at  least  A:?”. 
This  is  in  NP  as  we  have  seen  earlier.  As  the  second  problem,  consider:  “Given  G  and 
A:  >  0,  does  G  not  have  an  independent  set  of  size  at  least  k  +  1?”.  In  the  second  problem, 
if  the  answer  is  “no” ,  it  is  easily  certifiable  by  giving  an  independent  set  which  has  size  at 
least  A:  -t-  1. 

It  follows  by  definition  that  NPUco  —  NP  C  D^.  It  should  be  emphasized  that  there  is 
an  important  distinction  between  a  class  defined  in  terms  of  two  problems,  one  in  NP,  the 
other  in  co-NP  and  a  class  defined  in  terms  of  a  single  problem  simultaneously  belonging 
to  NP  and  to  co-NP.  D'’  is  an  example  of  the  former;  NP  n  co  —  NP  is  an  example  of  the 
latter. 

It  turns  out  that  not  only  does  the  problem  of  determining  the  facets  of  the  VP 
polytope  lie  in  this  new  class,  but  so  does  EXACT  VERTEX  PACKING.  In  fact,  both 
are  D^’-complete!  (See  [73,142].)  Somewhat  later  in  1985,  Papadimitriou  and  Wolfe  [l43] 
announced  that  V.V.  Vazirani  has  shown  that  the  recognition  problem  for  a-critical  graphs 
is  also  D^-complete,  although  we  are  unaware  of  a  published  proof  to  date. 

We  conclude  this  section  with  one  brief  remark  on  approximating  a(G).  Some  time 
ago,  Garey  and  Johnson  [21]  proved  that  in  a  sense  this  is  a  hopeless  task.  More  specifically, 
they  showed  that  if  one  could  find  a  polynomial  time  algorithm  which,  given  an  e  >  0, 
outputs  an  approximate  value  oc*(G)  in  the  sense  that  |a(G)  —  a*(G)|  <  e,  then  in  fact 
there  miist  be  a  polynomial  algorithm  for  MAXMVP  and  hence  P  =  NP! 

There  are,  of  course,  other  measures  of  “approximation”.  For  example,  suppose  a*(G) 
is  an  estimate  for  Q!(G).  One  could  consider  the  ratio  a(G)/a*(G)  as  a  measure  of  “good¬ 
ness”  of  approximation.  But  for  this  case  as  well,  the  news  is  not  good.  There  is  no  known 
algorithm  for  FINDVP  which  guarantees  a  ratio  any  better  than  O(n^),  where  as  usual  n 
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is  the  input  size  and  e  >  0. 

Some  recent  results  by  Feige,  Lovasz,  Goldwasser,  Safra  and  Szegedy  [144,145,146! 
are  of  special  interest  here.  These  authors  show  that  if  one  could  approximate  a(G)  in 
polynomial  time  to  within  a  factor  of  then  NP  C  QP,  where  QP  denotes  quasi¬ 

polynomial  time',  that  is,  0(2'°®”’').  This  set  inclusion  would  then  imply  NEXPTIME  = 
EXPTEME.  (This  equality  would  also  be  implied  by  P  =  NP.)  It  is  also  shown  that  if  one 
can  approximate  the  independence  number  within  a  constant  factor  in  polynomial  time, 
then  in  fact  every  NP  problem  can  be  solved  in  time. 

The  proofs  employ  techniques  of  interactive  proof  systems.  (See  the  “IP”  at  the  top 
of  our  complexity  clciss  diagram.) 

But  lest  we  leave  the  reader  on  too  negative  a  note,  it  has  been  shown  by  Boppana 
and  Halldorsson  [147]  that  there  is  a  polynomial  time  algorithm  which  will  approximate 
a(G)  within  a  factor  of  n/  log^  n. 

In  view  of  the  difficulties  encountered  in  approximating  the  size  of  a  maximum  inde¬ 
pendent  set,  one  may  find  the  following  surprising,  due  to  the  complementary  connection 
between  independent  sets  and  vertex  covers.  Gavril  [21]  first  observed  in  1974  that  there 
is  a  straightforward  polynomial  approximation  algorithm  which  for  any  graph  G  sup¬ 
plies  a  number  r*(G)  such  that  r*(G)/r(G)  <  2!  Remember  that  by  Gallai’s  result  [5l, 
r(G)  -f-  a{G)  =  |V(G)|.  Now  simply  construct  any  maximal  matching  M,  say  of  size  k. 
Then  the  set  of  endvertices  C  of  M  is  a  set  of  2k  vertices  and  by  the  maximality  of  M,  C 
is  a  vertex  cover.  Now  every  cover  of  G  must  in  particular  cover  M,  so  t(G)  >  k.  Thus 
we  have  2r(G)  >  2k  =  |C]  and  hence  \C\/t{G)  <2  as  claimed. 

This  ratio  of  2  was  improved  to  a  factor  of  2  —  fl((log  log  n)/ log  n)  by  Bar- Yehuda 
and  Even  in  1983  [148]  and  independently  by  Monien  and  Speckenmeyer  in  1985  [149]. 


5.  Matching  in  Parallel 

It  is  a  good  starting  point  for  this  section  to  recall  that  without  any  of  the  various 
additional  “bell  and  whistle”  conditions  discussed  in  Section  3,  the  problem  of  finding 
a  maximum  matching  for  general  graphs  can  be  done  in  polynomial  time  via  Edmonds 
algorithm  or  any  of  its  descendants.  Of  course  these  algorithms  are  all  sequential.  Note 
also  that  (a)  finding  a  perfect  matching  (or  showing  none  exists)  and  (b)  finding  a  maxima/ 
matching  are  both  sequentially  polynomial  as  well.  Polynomiality  of  (a)  follows  from 
Edmonds  algorithm  and  polynomiality  of  (b)  is  essentially  trivial  by  the  greedy  algorithm. 

The  situation  is  quite  different  when  one  moves  to  a  parallel  setting.  It  is  not  at  all 
clear  how  to  efficiently  parallelize  even  greedy  matching,  let  alone  Edmonds’  algorithm.  In 
fact,  perhaps  the  outstanding  open  question  regarding  matching  in  parallel  is  whether  or 
not  maximum  matching  (or  perfect  matching,  for  that  matter)  belongs  to  NC.  (There  aie 
certainly  parallel  algorithms  for  maximum  matching,  however.  See  [150,151,152,153,154]. 
They  simply  are  not  polylog  time  algorithms.) 

But  let  us  begin  with  the  (apparently)  even  simpler  problem  of  finding  a  maxima/ 
matching.  Let  us  denote  this  problem  by  MAXLMATCH.  So  that  the  reader  won’t  be 
kept  in  suspense,  let  us  announce  that  MAXLMATCH  is  now  known  to  be  in  NC  (in  fact, 
in  NC^).  But  just  how  it  got  there  is  an  interesting  story. 
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In  1980,  Lev  [155j  showed  that  for  bipartite  graphs,  MAXLMATCH  was  in  NC'*.  Actu¬ 
ally,  she  gave  three  different  algorithms  for  this  problem.  In  addition — and  this  seems  not 
to  be  widely  realized — she  also  gave  an  NC*  algorithm  for  finding  a  meiximum  matching 
(henceforth  MAXMMATCH)  in  any  regular  bipartite  graph  by  means  of  an  NC**  edge  col¬ 
oring  algorithm  for  this  family.  The  bound  was  improved  to  NC^  in  [156].  The  complexity 
for  general  regular  graphs  remains  open,  although  see  [157]. 

No  processor  count  was  mentioned  by  Lev,  although  Israeli  and  Shiloach  [158]  assess 
her  processor  bound  as  0(m^/ log  m). 

Four  years  later,  Keirp  and  Wigderson  [159,160]  extended  the  result  of  Lev  by  showing 
MAXLMATCH  to  be  in  NC'*  for  all  graphs.  Their  algorithm  employed  m  +  n  processors. 
The  Karp-Wigderson  algorithm  was  actually  a  special  case  of  the  first  NC  maximal  in¬ 
dependent  set  (hereafter  MAXLVP)  algorithm.  We  shall  return  to  their  algorithm  in  the 
next  section. 

In  [158],  Israeli  and  Shiloach  give  another  NC"*  algorithm  for  MAXLMATCH  using  m-f- 
n  processors  which  implemented  on  a  CRCW-PRAM  reduces  to  NC^.  Shortly  thereafter. 
Israeli  and  Itai  [l6l]  found  a  randomized  algorithm  for  MAXLMATCH  which  lowers  the 
polylog  time  exponent  by  2,  thus  placing  the  algorithm  in  RNC^.  (The  reader  should  note 
that  this  is  the  first  use  of  a  randomized  algorithm  in  this  paper,  but  not  the  last!) 

A  side  remairk  is  in  order  at  this  point.  Luby  [162,163]  proved  MAXMMATCH  to 
be  in  NC®  with  0{n)  processors.  Although  this  bound  is  not  as  good  as  that  of  Israeli 
and  Shiloach,  it  deserves  mention  because  it  introduced  a  new  and  sophisticated  technique 
the  full  potential  of  which  has  probably  not  yet  been  realized.  The  procedure  we  refer 
to  formulates  a  randomized  algorithm  (in  this  application  to  both  MAXLMATCH  and  to 
MAXMLVP)  and  then  removes  the  randomness  to  obtain  a  deterministic  algorithm.  Such 
techniques  were  used  in  Karp  and  Wigderson  [159,160]  and  previously  by  Luby  himself 
[164,165]  primarily  in  connection  with  MAXLVP.  (See  the  next  section.)  But  these  older 
procedures  have  the  unpleasant  side  effect  of  producing  a  rather  large  blow  up  in  the 
number  of  processors  required.  In  [162,163],  Luby  develops  a  new  approach  making  clever 
use  of  a  new  probabilistic  space  which  ultimately  permits  removal  of  randomization  with 
no  increase  in  the  number  of  processors  required. 

However,  it  was  in  [164,165]  that  Luby  reduced  the  complexity  of  the  problem  to  NC^, 
again  proceeding  as  did  Karp  and  Wigderson  to  obtain  his  bound  as  a  special  case  of  an 
algorithm  for  MAXLVP.  Again,  we  will  have  more  to  say  about  this  in  the  next  section. 

Now  let  us  turn  to  parallel  algorithms  for  perfect  and  maximum  matching.  To  further 
muddy  the  waters,  in  the  parallel  case,  we  shall  have  to  differentiate  between  the  problems 
of  deciding  if  a  perfect  matching  exists  (hereafter  ?PM)  and  the  search  version  of  the 
problem  (FINDPM).  Remember;  none  of  these  three  problems  is  known  to  be  in  NC. 

There  are  some  intriguing  new  questions  which  £irise  in  parallel  computing  which 
are  not  really  significant  in  the  sequential  situation.  One  of  these  asks:  “What  is  the 
difference  in  complexity  between  decision  problems  and  search  problems?”  The  interested 
reader  may  consult  [166]  for  a  diversion  into  the  land  of  rank  and  independence  oracles 
and  their  relative  power. 

In  1979,  Lovasz  [167]  proposed  a  test  for  TPM  by  giving  a  randomized  procedure  for 
testing  if  a  certain  matrix  is  non-singular.  Borodin,  von  zur  Gathen  and  Hopcroft  [168,169) 
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then  combined  this  with  an  NC^  algorithm  already  developed  by  Csanky  [l70j  for  testing 
matrix  non-singulaxity  to  give  an  RNC^  algorithm  for  ?PM. 

For  the  search  problem  FINDPM,  a  breakthrough  was  obtained  in  1985  by  Karp,  Upfal 
and  Wigderson  [I71,l72j  who  found  the  first  RNC  algorithm.  Their  procedure  actually 
placed  the  problem  in  RNC^.  Shortly  thereafter,  Galil  and  Pan  [173,174]  considerably 
improved  the  processor  bound.  In  1987,  Mulmuley,  Vazirani  and  Vazirani  [175,176]  put 
the  problem  in  RNC^  with  a  faster  algorithm  with  processor  bound  0{n^'^m). 

Now  what  about  MAXMMATCH?  Actually,  Mulmuley,  Vazirani  and  Vazirani  [175, 
176]  showed  that  this  problem  is  also  in  RNC^  as  well  as  the  following  related  search 
problems; 

(a)  Find  a  maximum  weight  perfect  matching  in  a  graph  the  edge  weights  of  which 
are  given  in  unary,  and 

(b)  Find  a  matching  covering  a  set  of  vertices  of  maximum  weight  in  a  graph  where 
the  weights  on  the  vertices  are  given  in  binary. 

Moreover,  they  show  that  if  any  of  these  three  search  problems  is  in  NC,  they  all  are. 
(The  Galil-Pan  improved  processor  bounds  apply  here  too.) 

A  clever  observation  by  Karloff  [177]  can  be  used  to  turn  all  these  algorithms  of  Karp, 
Upfal  and  Wigderson  and  Mulmuley,  Vazirani  and  Vazirani  from  Monte  Carlo  to  Las  Vegas, 
so  all  are  now  known  to  actually  lie  in  “zero-error” class  ZNC^. 

It  is  interesting  that  a  close  relative  to  the  above  three  problems  has  unknown  com¬ 
plexity; 

(c)  Find  a  minimum  weight  perfect  matching  in  a  graph  with  edge  weights  given  in 
binary. 

Before  leaving  these  two  papers,  we  should  also  inform  the  reader  that  Mulmuley, 
Vazirani  and  Vazirani  were  also  able  to  show  that  EXACTMATCH  lies  in  RNC'  as  well. 
Recall  from  Section  3  that  it  is  not  known  if  this  problem  can  be  solved  deterministically 
in  polynomial  time. 

Much  of  the  information  given  in  this  section  down  to  this  point  can  be  found  (and  in 
considerably  greater  detail)  in  the  two  excellent  surveys  by  Galil  [178,179]  which  appeared 
necirly  simultaneously  in  1986. 

We  now  return  to  the  basic  unsolved  problems  motivating  this  section;  that  is;  Are 
any  of  ?PM,  FINDPM  or  MAXMMATCH  in  NC? 

Although  these  problems  remain  open  in  general,  membership  has  been  affirmed  for 
varioiis  graph  classes. 

In  addition  to  regular  bipartite  graphs,  as  discussed  earlier,  (see  [155,156]),  FINDPM 
has  been  shown  to  be  in  NC  for  the  following  graph  families: 

1.  Claw-free  graphs  (and  hence  for  line  graphs  as  well).  Chrobak  and  Naor  [180]  give  an 
NC^  algorithm.  See  also  Naor  [181]. 

2.  Planar  bipartite  graphs.  This  was  done  by  Miller  and  Naor  in  1989  [182].  Note  that 
FINDPM  remains  open  for  pianar  graphs  in  general.  Here  is  an  instance  where,  on  the 
other  hand,  ?PM  is  known  to  be  in  NC!  This  is  an  immediate  corollary  of  the  fact  that 
counting  perfect  matchings  (hencforth  #PM)  is  in  NC^  when  the  graph  is  planar.  We 
shall  deal  with  this  again  in  Section  7  below. 
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3.  Dense  graphs.  A  graph  is  said  to  be  dense  if  mindeg  G  >  n/2.  Dahlhaus  and  Karpinski 
[183]  found  an  NC^  algorithm  requiring  0(n®)  processors  and  somewhat  later,  Dahlhaus, 
Hajnal  and  Karpinski  [184,185]  found  an  NC"*  procedure  using  a  linear  number  of  proces¬ 
sors. 

Note  that  a  dense  graph  (on  an  even  number  of  vertices)  always  hats  a  perfect  matching. 
This  is  an  immediate  corollary  to  the  classical  theorem  of  Dirac  which  says  more,  namely 
that  such  a  graph  must  have  a  Hamilton  cycle. 

A  very  interesting  “negative”  result  is  also  proved  in  these  three  papers.  Namely,  if 
one  drops  the  degree  bound  to  n/2  —  c  for  <iny  c  >  0,  then  the  problem  is  just  as  hard 
(under  NC-reductions)  as  FINDPM  for  general  graphs! 

4.  Strongly  chordal  graphs.  A  graph  G  is  chordal  if  every  cycle  of  length  >  3  has  a  chord. 
This  is  equivalent  to  having  a  perfect  elimination  scheme  <  on  the  vertex  set;  namely,  if 
(u,  u),  (u,  ly)  e  .£^(G),  then  (u.ty)  G  EiG).  Strongly  chordal  graphs  can  be  defined  by 
imposing  the  additional  requirement  that  the  perfect  elimination  scheme  also  satisfies:  if 
X  <  u  and  y  <  v  and  if  (i, y),  (i,  u),  (y,  u)  G  E{G),  then  (u,  v)  £  E{G). 

Strongly  chordal  graphs  axe  in  NC^  by  Dahlhaus  and  Karpinski  [186] .  Although 
FINDPM  is  in  NC^  for  strongly  chordal  graphs,  for  the  wider  class  of  chordal  graphs,  it  is 
as  hard  as  the  general  bipartite  case,  the  parallel  complexity  of  which,  as  we  have  remarked 
several  times  already,  is  imknown. 

5.  Co-comparability  graphs.  A  precedence  graph  is  an  acyclic  transitively  closed  di¬ 
rected  graph.  A  comparability  graph  is  an  undirected  graph  which  can  be  oriented  so 
as  to  become  a  precedence  graph.  Co-comparability  graphs  are  the  complements  of 
comparability  graphs.  Helmbold  and  Mayr  [187,188]  have  produced  an  NC^  algorithm  for 
these  graphs. 

There  is  a  close  relationship  between  co-comparability  graphs  and  the  so-called  two 
processor  scheduling  problem.  As  corollaries,  one  obtains  here  NC^  algorithms  for  FINDPM 
for  the  class  of  permutation  graphs  and  partial  orders  of  dimension  2,  as  well  as  interval 
graphs. 

An  NC  algorithm  for  co-compaxability  graphs  was  independently  found  by  Kozen, 
Vazirani  and  Vazirani  [189],  although  no  polylog  time  exponent  is  given.  This  paper  is 
also  interesting  from  another  point  of  view.  Although  the  membership  of  ?PM  in  NC  is  a 
famous  unsolved  problem,  here  these  three  authors  give  an  NC  algorithm  for  the  closely 
related  problem:  “Does  graph  G  have  a  unique  perfect  matching?”  Moreover,  Rabin  and 
Vazirani  [190]  give  an  NC^  algorithm  for  FINDPM,  in  the  caise  when  G  has  a  unique  perfect 
matching. 

6.  Bipartite  graphs  with  a  polynomially  bounded  number  of  perfect  matchings.  Grigoriev 
and  Karpinski  [191]  give  an  NC^  algorithm  for  this  class.  In  fact,  the  authors  do  more.  For 
such  graphs,  they  show  that  finding  all  perfect  matchings  is  in  NC^.  Moreover,  if  $(G)  is 
bounded  by  a  constant,  they  find  all  perfect  matchings  with  a  faster — NC^ — algorithm. 

An  interesting  sidelight  here  is  the  problem  of  recognizing  such  graphs!  This  is  leading 
us  perilously  close  to  yet  another  important  problem:  counting  the  number  $(G)  of  perfect 
matchings  in  a  graph  G.  This  has  been  shown  to  be  #P-complete,  so  hope  for  a  polynomial 
algorithm,  much  less  an  NC  algorithm,  to  compute  $(G)  is  dim  indeed.  However,  Grigoriev 
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and  Karpinski  do  give  an  RNC^  algorithm  which,  given  any  polynomial  cn'',  will  decide  if 
^(G)  <  cn^. 

For  the  problem  of  ?PM  in  bipartite  graphs  with  a  polynomially  bounded  number  of 
perfect  matchings,  these  two  authors  also  give  aji  NC^  algorithm  in  their  paper.  They  also 
state  that  their  results  can  be  extended  from  bipartite  graphs  to  graphs  in  general,  but  no 
proofs  are  given. 

Let  us  insert  at  this  point  a  few  remarks  on  the  problem  ?PM.  In  the  paper  of  Grigoriev 
and  Kajpinski  discussed  in  the  preceding  three  paragraphs,  one  also  finds  the  result  that 
?PM  belongs  to  NC^  for  graphs  with  a  polynomially  bounded  number  of  perfect  matchings. 
We  will  see  in  the  laist  section  of  our  paper  that  exact  counting  of  perfect  matchings — 
#PM — is  in  NC  for  certain  classes  of  graphs;  e.g.,  planar  graphs.  Thus  by  default,  ?PM 
is  in  NC  for  all  such  classes.  (Recall  that  #PM  is  ^P-complete  and  therefore  NP-hard  in 
general.) 

Now  let  us  return  to  the  maximal  matching  problem — MAXLMATCH — discussed  at 
the  beginning  of  this  section.  As  we  noted  above,  it  is  now  known  to  be  in  NC,  although  not 
without  a  struggle!  Motivated  still  by  the  apparent  difficulty  of  doing  greedy  procedures  in 
parallel,  researchers  have  posed  and  investigated  the  folowing  variant  of  MAXLMATCH. 

Suppose  the  edges  of  a  graph  G  are  numbered  1  through  m.  The  lexicographically 
first  maximal  matching  problem — LFMAXLMATCH — is  stated  as  follows.  Does  the  lex¬ 
icographically  first  maximal  matching  in  G  contain  edge  m?  Is  the  problem  in  NC  or 
perhaps  even  P-complete?  These  are  open  questions.  (See  [190,192,193,194].) 

In  the  latter  three  references,  the  problem  is  one  of  several  used  to  motivate  the 
creation  of  a  new  complexity  class — CC.  (This  will  be  the  last  class  in  our  lattice  diagram 
to  be  discussed.)  This  class  is  discussed  at  length  in  [193,192,194]  with  perhaps  the  most 
succinct  treatment  in  [22]. 

First  we  must  define  the  circuit  value  problem — or  CV:  “Given  a  Boolean  circuit  with 
n  inputs  and  a  single  output,  together  with  a  binary  input  value  for  ecich  input  gate,  is 
the  output  =  1?” 

CV  is  known  to  be  P-complete  under  logspace  reductions.  (See  [l95|.) 

Now  we  modify  the  CV  problem  as  follows.  Suppose  we  restrict  all  circuit  elements 
to  have  precisely  two  inputs  and  two  outputs,  one  output  yielding  x  Ay,  the  other  yielding 
z  V  y  for  binary  inputs  z  and  y.  These  circuit  elements  are  called  comparators  and  we 
have  the  comparator  circuit  value  problem — or  CCV:  “Given  a  Boolean  comparator  circuit 
and  binary  inputs,  is  the  output  =  1?” 

The  class  CC  is  defined  as  the  class  of  problems  log-space  reducible  to  the  CCV 
problem. 

Subramanian  [192]  has  shown  LFMAXLMATCH  to  be  CC-complete.  In  addition,  he 
also  shows  another  old  friend  from  earlier  in  this  paper  to  be  CC-complete:  the  Stable 
Roommates  Problem.  For  a  proof  of  this,  as  well  as  other  CC-complete  variants,  see  [196]. 

There  are  several  interesting  open  problems  surrounding  class  CC.  Is  there  a  machine 
characterization  of  CC?  Are  CC  and  NC  comparable? 

Feder  [87]  has  shown  that  CC  contains  the  class  NL  (non-deterministic  log-space). 
(See  once  again  [22]  for  a  description  of  this  class.) 

It  seems  that  there  are  no  results  on  the  parallel  complexity  or  indeed  any  known 


relationships  to  classes  inside  P  (e.g.,  CC)  for  lexicographically  first  maximum  matching. 

Let  us  insert  here  a  short  remaxk  on  the  parallel  complexity  of  MATROID  PARITY 
introduced  and  discussed  in  Section  3.  Very  recently,  Narayanan,  Saran  and  V.V.  Vazirani 
[197]  have  shown  that  if  the  matroids  are  linear,  there  is  an  RNC^  algorithm  to  solve  the 
problem  which  uses  0(n‘‘  ®)  pvocessors. 

To  close  this  section,  let  mention  a  very  recent  result  on  approximating  a  maximum 
matching  in  parallel.  Fischer,  Goldberg  and  Plotkin  [198]  have  developed  an  NC^  algorithm 
which,  given  a  constant  k  >  0,  computes  a  matching  of  cardinality  at  least  1  —  l/(k  -f  l) 
times  the  maximum.  The  algorithm  requires  processors. 

6.  Vertex  Packing  in  Parallel 

The  problems  considered  here  have  matching  analogs  which  were  discussed  in  the 
preceding  section.  In  particular,  we  will  treat  the  maximal  independent  set  problem 
(MAXLVP)  and  its  lex-first  cousin  LFMAXLVP. 

Let  us  dispose  of  the  latter  problem  first.  LFMAXLVP  was  shown  to  be  P-complete 
under  log-space  reduction  by  Cook  in  1985  [64j.  (See  also  [25,26].) 

On  the  other  hand,  MAXLVP  wzis  first  shown  to  be  in  NC  by  Karp  and  VVigderson 
[159,160]  in  1984.  They  gave  an  NC'*  algorithm  using  0(n^/(log  n)^)  processors.  In  the 
same  paper,  they  also  provided  a  faster  randomized  version,  thus  showing  the  problem  to 
be  in  RNC^.  (This  result  automatically  placed  LFMAXLMATCH  in  the  same  two  classes; 
this  was  discussed  in  Section  5.) 

It  should  be  noted  that  the  Karp-Wigderson  paper  was  a  deep  piece  of  work.  In  par¬ 
ticular,  in  order  to  go  from  the  randomized  version  of  their  algorithm  to  the  deterministic 
version,  they  appealed  to  the  theory  of  block  designs,  no  less! 

Two  independent  improvements  to  NC^  are  due  independently  to  Luby  [164,165]  and 
to  Alon,  Babai  and  Itai  [199],  Both  papers  first  present  a  randomized  version  of  their 
respective  algorithms  and  then  cleverly  remove  randomness. 

In  the  past  five  years  or  so,  several  papers  have  appeared  all  with  their  main  theme 
being  improvement  of  the  Karp-Wigderson  time-processor  product  boimd.  Goldberg  and 
Spencer  first  found  an  NC"*  algorithm  using  0(n)  processors  [200,201]  and  then  produced 
an  NC^  algorithm  requiring  0((n  -t-  m)/logn)  processors  [202].  Preceding  both  these 
results,  Goldberg  had  produced  a  parallel  algorithm  whose  time-processor  product  was 
better  than  Luby  or  Karp  and  Wigderson,  but  the  algorithm  was  not  polylog  time  [203], 

The  Goldberg-Spencer  papers  also  asked  an  interesting  new  question.  Turan  [204] 
proved  in  1954  that  every  graph  with  n  vertices  and  m  edges  must  contain  an  independent 
set  of  size  at  least  n^/(2m  -f-  n).  Can  such  a  set  be  found  via  an  NC  algorithm?  In  [205], 
they  answered  their  own  question  in  the  affirmative,  giving  an  NC^  algorithm. 

Returning  now  to  MAXLVP,  let  us  note  that  a  more  efficient  NC  algorithm  for  the 
special  case  when  G  is  planar  was  found  by  Xin  He  [206]  who  found  an  NC^  routine 
requiring  only  0{n)  processors.  In  1988,  this  result  was  extended  by  Khuller  [207]  to  ^"3,3- 
free  graphs.  Khuller  made  use  of  a  special  decomposition  technique  developed  by  V.V. 
Vazirani  [75,76]  in  his  work  on  $(<S’)  for  Kj  s-free  graphs  which  we  will  meet  again  in  the 
final  section  of  this  paper. 


24 


Last  year,  Dadoun  and  Kirkpatrick  [208]  reduced  the  time  bound  for  planar  graphs 
to  0(log  n  log*  n),  while  preserving  the  0(n)  processor  bound.  (Here  log"  n  denotes  the 
number  of  applications  of  the  log  function  required  to  reduce  n  to  a  constant  value.)  They 
also  gave  an  RNC^  algorithm  for  MAXLVP. 

Now  what  about  the  MAXMVP  problem?  In  other  words,  what  can  be  said  about 
the  (apparently  much  hajder)  task  of  computing  a  maximum  independent  set  in  parallel? 

NC  algorithms  have  been  found  for  interval  graphs  by  Helmbold  and  Mayr  [209]  and 
Bertossi  and  Bonuccelli  [210])  and  more  recently  for  the  more  general  class  of  chordal 
graphs  by  Naor,  Naor  and  Schaffer  [211,212].  To  date,  these  are  the  only  special  classes  of 
graphs  known  to  the  author  for  which  MAXMVP  has  been  shown  to  be  in  NC. 

Finally,  we  also  note  that  an  NC  algorithm  for  listing  all  maximal  independent  sets  in  a 
chordal  graph  is  also  given  in  [211,212].  Dahlhaus  and  Karpinski  [213]  have  now  developed 
NC  algorithms  for  listing  all  maximal  independent  sets  for  other  classes  of  graphs,  all  of 
which  have  polynomially  bounded  numbers  of  maximal  independent  sets.  (Chordal  graphs 
have  this  property,  for  example.) 


7.  Enumeration 

In  this  section,  we  will  give  a  short  summary  of  three  types  of  problems  dealing  with 
enumeration  of  matchings  and  independent  sets; 

1.  Exact  counting 

2.  Approximate  counting 

3.  Complete  listing. 

Let  us  begin  with  counting  perfect  matchings  (#PM)  and  a  sobering  result.  At  this 
point,  the  reader  will  recall  the  complexity  class  #P  from  Section  2.  Valiant  [66,67] 
proved  the  important  (but  somewhat  deflating)  result  that  counting  the  number  of  perfect 
matchings  in  a  bipartite  graph  (i.e.,  determining  ^(G))  is  #P-complete  and  therefore  NP- 
hard.  It  is  important  to  realize  that  here  we  have  a  problem  in  P  (?PM  or  FINDPM)  the 
counting  version  of  which  is  f(^P-hard!  It  is  therefore  not  surprising  that  counting  maximum 
independent  sets  (^(iMAXMVP)  is  also  #P-complete.  So  the  best  we  can  reasonably  hope 
for  in  the  way  of  polynomial  algorithms  for  exact  counting  is  to  be  able  to  count  these 
objects  for  certain  special  classes  of  graphs. 

For  matchings,  Kasteleyn  was  the  pioneer.  Motivated  by  a  question  from  crystal 
physics  (counting  so-called  “dimers”  on  a  crystal  lattice),  he  showed  [214,215]  that  there 
is  a  polynomial  algorithm  to  count  the  number  of  perfect  matchings  in  any  planar  graph. 
His  method  involves  showing  that  any  undirected  planar  graph  can  have  its  edges  oriented 
in  such  a  way  that  a  certain  matrix  associated  with  the  resulting  directed  graph  has  the 
property  that  its  determinant  is  the  square  of  $(G).  Such  an  orientation  has  come  to  be 
called  Pfaffian  after  the  classical  Pfaffian  function  of  a  matrix.  Since  determinants  can 
be  evaluated  in  polynomial  time,  we  have  our  desired  result:  a  polynomial  algorithm  for 
^PM. 
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Subsequently,  Kasteleyn’s  method  was  extended  to  a  wider  class  of  graphs  by  Little 
'2161  who  showed  that  every  iir3,3-free  graph  has  a  Pfaffian  crientation,  although  he  did  not 
deal  with  the  problem  of  finding  a  polynomial  algorithm  to  construct  one.  In  1988,  V.V. 
Vazirani  175,76]  showed  how  to  construct  such  a  polynomial  algorithm.  In  fact,  finding  a 
Pfaffian  orientation  (and  hence  determining  <^(G))  in  a  iifs.s-free  graph  is  even  in  NC  !  In 
the  same  paper,  Vazirani  showed  that  testing  to  see  if  a  graph  is  iii3_3-free  is  also  in  NC. 

Note  that  a  trivial  corollary  of  Vazirani’s  work  shows  now  that  ?PM  for  K^^z-hee 
graphs  is  in  NC.  But  the  nagging  problem  of  FINDPM  remains  open,  even  for  ifs^s-free 
graphs. 

Finally,  Vazirani  showed  that  the  decision  version  of  the  problem  EXACTMATCH, 
introduced  in  Section  3,  also  lies  in  NC,  for  the  family  of  iCs.s-free  graphs.  Again,  we 
emphasize  that  the  search  problem  remains  open. 

But  let  us  return  for  a  moment  to  the  problem  of  finding  a  Pfaffian  orientation  for  a 
graph  in  general.  One  can  pose  three  (at  least  formally)  different  questions  here: 

1.  Does  a  given  graph  G  have  a  Pfaffian  orientation? 

2.  Given  an  orientation  of  graph  G,  is  it  Pfaffian? 

3.  Given  a  graph  G,  find  a  Pfaffian  orientation. 

Observe  that  the  first  two  questions  are  decision  problems,  while  the  third  is  a  search 
problem.  The  complexity  of  all  three  questions  is  currently  unknown.  That  question  2  is 
in  co-NP  follows  from  the  work  of  Kasteleyn  [214,215].  More  recently,  Lovasz  (implicitly) 
[217]  and  V.V.  Vazirani  and  Yannakakis  [218,219]  (explicitly,  using  Lovasz’s  polynomial 
time  algorithm  for  computing  the  GF[2l  rank  of  the  set  of  perfect  matchings  of  a  graph) 
have  shown  that  problems  1  and  2  are  polynomially  equivalent  and  hence  problem  1  is  also 
in  co-NP. 

It  has  been  pointed  out  to  us  by  Pulleyblank  [220]  that  it  also  follows  from  the  work 
of  Lovasz  on  the  matching  lattice  [217]  that  problem  3  is  equivalent  to  problems  1  and  2. 

In  the  case  when  graph  G  is  bipartite,  there  is  an  interesting  connection  between  these 
three  problems  and  a  fourth  problem  which  has  been  studied  by  Seymour  and  Thomassen 
[22li,  among  others. 

4.  Given  a  directed  graph  D,  does  it  fail  to  contain  a  directed  cycle  of  even  length? 

Vazirani  and  Yannakakis  have  shown  [218,219]  that,  for  bipartite  graphs,  problem  4 
is  polynomially  equivalent  to  problems  1  and  2  (and  hence  problem  3,  as  well,  by  Pulley- 
blank's  remark). 

Incidently,  although  #PM  is  in  P  for  planar  graphs  by  Kasteleyn’s  work,  it  has  been 
shown  more  recently  by  Jerrum  [222]  that  counting  all  matchings  in  a  planar  graph  G  is 
#P'Complete.  Going  back  to  crystal  lattices  for  a  moment,  Kasteleyn’s  method  of  course 
applies  only  to  those  which  are  planar.  What  about  counting  dimers  on  3-dimensional 
lattices?  Even  this  is  unknown.  (Cf.  Sinclair  [223].) 

Let  us  also  mention  at  this  point  that  Irving  and  Leather  [224]  proved  in  1986  that 
counting  the  number  of  stable  marriages  (i.e.,  induced  bipartite  matchings)  is  #P-complete. 

We  now  proceed  from  exact  counting  to  approximate  counting.  The  fact  that  exact 
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counting  is  #P-complete  has  lent  impetus  to  trying  for  a  less  ambitious  goal.  Can  we  and 
a  “good  approximation”  for  $(G)  “efficiently”? 

Here  many — but  not  all — the  interesting  results  have  been  obtained  for  the  bipartite 
case.  The  reason  is  that  evaluating  $(G)  for  a  bipartite  graph  G  is  the  same  problem  as 
evaluating  the  permanent  of  a  square  (0-1)  matrix  A.  Let  A  be  any  n  x  n  matrix.  The 
permanent  of  A,  denoted  per  A,  is  just  the  sum  of  all  the  n!  terms  of  its  determinant, 
except  all  terms  axe  taJcen  with  a  plus  sign.  Although  the  permanent  is  therefore  even 
slightly  easier  than  the  determinant  to  define,  it  is  a  much  more  badly  behaved  function! 
(See  Mine  [225]  for  a  general  reference  on  permanents.)  In  particular,  although  polynomial 
algorithms  for  evaluating  the  determinant  are  well-known  to  every  undergraduate  mathe¬ 
matics  student,  the  best  known  algorithm  for  evaluating  the  permanent  has  the  ugly  time 
bound  of  0(n2”)!  (See  Ryser  [226].) 

Fortunately,  there  has  very  recently  appeared  an  excellent  survey  by  Luby  [227]  dealing 
with  approximation  algorithms  for  the  permanent  and  we  heartily  recommend  that  the 
interested  reader  consult  it.  Our  remarks  on  approximation  will  therefore  be  brief. 

An  (e,<5)  approximation  algorithm  for  per  A  is  a  randomized  (Monte  Carlo)  al¬ 
gorithm  which  accepts  an  n  x  n  matrix  A  and  two  positive  real  numbers,  e  and  6.  The 
algorithm  then  outputs  a  number  F  as  an  estimate  for  per  A  in  the  sense  that: 

Prob[{l  —  e)per  A  <Y  <  (1  +  c)per  A]  >  1  —  (5. 

An  (€,5)  approximation  algorithm  is  said  to  be  a  f  :lly-poiynomial  randomized  ap¬ 
proximation  scheme  (fpras)  if  its  runr  '  ig  time  is  polynomial  in  n,l/e  and  1/6. 

It  is  an  open  question  as  to  whether  or  not  there  exists  a  fpras  for  the  permanent 
function,  and  therefore  for  $(0"),  for  G  biparfle. 

Very  recently,  two  major  lines  of  research  on  this  question  have  been  undertaken. 
The  first  of  these  has  resulted  in  an  approximation  algorithm  which  meets  the  accuracy 
demand  of  an  fpras,  but  in  superpolynomial  time.  More  specifically,  Karmarkar,  Karp, 
Lipton,  Lovasz  and  Luby  [228]  have  designed  a  Monte  Carlo  algorithm  which  yields  the 
desired  output  in  time  2”/^(l/c^)  log(l/5)p(n),  where  p(n)  is  a  polynomial  in  n.  For  fixed 
e  and  6  this  is  about  the  square  root  of  the  time  bound  for  Ryser’s  algorithm.  (Even  more 
recently,  Jerrum  and  U.  Vazirani  [229]  have  designed  a  different  algorithm  with  worst-case 
time  complexity  exp(C>(n^/^  log^  n)),  which  improves  that  of  [228].) 

The  five  authors  of  [228]  also  pose  the  following  open  question:  Is  there  a  deterministic 
algorithm  with  running  time  o(2")  which  accepts  as  input  matrbe  A  and  a  positive  real  e 
and  outputs  Y  such  that 


(l  —  e)per  A  <  F  <  (1  +  f)per  A  ? 

The  second  approach  which  seems  to  have  originated  with  an  idea  of  Broder  [230] 
and  been  completed  by  Jerrum  and  Sinclair  [231,232].  The  latter  two  authors  succeeded 
in  finding  a  fpras  for  dense  permanents,  that  is  for  dense  bipartite  graphs.  The  Jerrum- 
Sinclair  papers  (and  several  other  companion  papers)  are  not  only  important  for  this  result, 
but  perhaps  even  more  so  for  introducing  novel  approaches  using  such  esoteric  concepts 
from  probability  theory  as  rapidly  mixing  Markov  chains  and  conductance.  (Broder  too 
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deals  with  slightly  modified  versions  of  the  same.)  In  general,  their  common  approach 
deals  with  reducing  the  problem  of  approximately  counting  perfect  matchings  to  that  of 
generating  them  at  random  from  an  almost  uniform  distribution. 

Using  these  ideas,  Dagum,  Luby,  Mihail  and  U.V.  Vazirani  [233,234]  (see  also  Dagum 
and  Luby  [235])  have  achieved  a  polynomial  speed  up  of  the  algorithm  of  Jerrum  and 
Sinclair  and  used  it  to  show  that  there  is  also  a  fpras  for  bipartite  graphs  with  large  factor 
size.  The  factor  size  of  a  bipartite  graph  G  =  A  U  B  (where  |.4.|  =  '  9|  =  n)  is  the 
maximum  number  of  edge-disjoint  perfect  matchings  in  G.  (Observe  that  a  graph  with  an 
an-factor  must  have  minimum  degree  at  least  an,  but  not  necessarily  vice-versa.)  In  this 
area,  Dagum  and  Luby  have  shown  that  there  is  a  fpras  for  bipartite  graphs  with  factor 
size  at  least  an  for  any  constant  a  >  0. 

These  results  are  even  more  interesting  when  compared  to  some  related  new  com¬ 
pleteness  results.  Broder  [230]  has  shown  that  exact  counting  in  dense  graphs  is  as  hard  as 
exacting  counting  in  general  and  so  is  ^P-complete.  Dagum  and  Luby  [235]  and  Dagum, 
Luby,  Mihail  and  U.V.  Vazirani  [233,234]  have  shown  that  exact  counting  in  /(n)-regular 
bipartite  graphs  is  ?^P-complete  for  any  f{n)  such  that  3  <  f{n)  <  n  —  3.  In  fact,  they 
show  that  for  any  €  >  0  and  for  any  function  f(n)  such  that  3  <  /(^)  <  the  exis¬ 

tence  of  a  fpras  for  /(n)-regular  bipartite  graphs  would  imply  the  existence  of  a  fpras  for 
all  bipartite  graphs! 

And  finally  a  word  or  two  about  finding  (i.e.,  listing)  all  matchings  and  independent 
sets.  Indeed  the  literature  is  quite  sparse  for  these  problems. 

For  bipartite  graphs,  Grigoriev  and  Karpinski  [191]  have  developed  algorithms  for 
constructing  all  maximum  matchings  in  the  case  when  $(G)  is  bounded.  More  particularly, 
they  show  t(^PM  is  in  NC^  when  $(C?)  is  bounded  by  a  constant  and  in  NC^  when  $(G)  is 
polynomially  bounded  (i.e.,  $(G)  <  cn^).  They  claim  their  results  extend  to  non-bipartite 
graphs  as  well,  although  they  give  no  proofs. 

For  bipartite  graphs  in  general,  it  appears  that  the  best  algorithm  for  finding  all 
perfect  matchings  is  due  to  Fukuda  and  Matsui  [236]  who  found  an  0((^(G)  -h  l)(n  -h  m  -i- 
n^-®))  (sequential)  algorithm  for  the  problem.  We  are  unaware  of  any  published  parallel 
algorithms  in  this  case. 

For  independent  set  listing  problems,  even  less  seems  to  be  known.  The  sole  result  we 
will  mention  is  the  following.  Let  Af  denote  the  number  of  maxima/  independent  sets  in 
a  graph.  The  fastest  sequential  algorithm  for  constructing  all  maximal  independent  sets 
in  the  graph  has  0{nmM)  time  and  0{n  -|-  m)  space.  It  is  due  independently  to  Chiba 
and  Nishizeki  [237]  and  Tsukiyama,  Ide,  Ariyoshi  and  Shirakawa  [238].  Of  course,  this  is 
a  polynomial  algorithm  if  M  is  polynomially  bounded. 

More  recently,  Dahlhaus  and  Karpinski  [213]  have  obtained  a  parallel  algorithm  for 
this  problem  which  rims  in  C>(log^(nA/))  time  and  uses  0{M^n^)  processors.  Note  that  if 
iVf  is  polynomially  bounded,  this  becomes  an  NC  algorithm.  There  are  many  interesting 
classes  of  graphs  with  bounded  M  and  the  authors  give  a  list  of  eight  such  families. 


8.  Lower  Bounds 

Note  that  all  of  the  considerable  work  discussed  and/or  referred  to  above  in  this 
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paper  possesses  one  common  thread:  Can  we  find  a  faster  algorithm  to  solve  the  problem 
at  hand? 

Much  less  progress  has  been  made  on  approaches  made  from  the  opposite  direction: 
Can  we  prove  that  no  polynomial  time  algorithm  is  possible  for  a  given  problem?  Or, 
more  generally,  can  we  prove  that  no  algorithm  is  possible  in  a  given  time  or  space  for  a 
given  problem? 

Though  results  of  this  kind — that  is,  finding  lower  bounds  for  the  computational 
complexity  of  certain  problems — are  much  more  sparse,  we  will  close  this  paper  with  a 
brief  overview  of  the  work  of  A.  A.  Razborov  in  this  area.  His  work  has  caused  considerable 
excitement  among  complexity  theorists  world-wide  and  has  won  him  the  Nevanlinna  Prize 
at  the  International  Congress  of  Mathematicians  held  in  Kyoto  in  1990.  For  our  brief 
synopsis,  we  borrow  heavily  from  the  articles  of  Lovasz  [239]  and  Sipser  [45]. 

This  work  is  cast  in  the  terminology  of  boolean  circuit  theory.  (We  mentioned  boolean 
circuits  briefly  in  our  treatment  of  the  complexity  class  CC  in  Section  5.)  A  boolean 
circuit  for  a  computation  is  an  acyclic  directed  graph  the  nodes  of  which  represent  the 
elementary  steps  in  the  computation.  These  nodes  are  also  called  gates.  Gates  having 
indegree  0  are  called  mput  gates  and  those  with  outdegree  0  are  output  gates.  The  simplest 
kinds  of  gates  are  AND,  OR  and  NOT  gates.  The  size  of  a  boolean  circuit  is  the  number  of 
gates  in  it.  Every  polynomial  time  computable  fimction  can  be  computed  by  a  polynomial 
size  boolean  circuit.  More  precisely  this  means  that  if  Z,  is  a  language  in  P  and  Ln  denotes 
the  set  of  strings  of  length  n  in  L,  then  there  exists  a  family  Bn  of  boolean  circuits  such 
that  Bn  takes  n  input  bits  and  recognizes  Ln  (i.e.,  outputs  1  if  and  only  if  the  n-bit  input 
string  is  in  Ln)',  and  the  size  of  Bn  is  bounded  by  n'  for  some  fixed  constant  c. 

So  if  we  could  prove  that  a  superpolynomial  lower  bound  exists  on  the  circuit  size 
required  by  some  NP  problem  such  as  the  existence  of  a  clique  of  given  size,  then  this 
would  suffice  to  separate  NP  from  P. 

Now  let  us  define  a  boolean  circuit  to  be  monotone  if  it  contains  no  NOT  gates. 
Razborov,  in  two  1985  papers,  showed  that  neither  (a)  [240]  deciding  the  existence  of  a 
clique  of  given  size  in  a  graph  nor  (b)  [241]  deciding  the  existence  of  a  perfect  matching 
in  a  bipartite  graph  (a  special  case  of  (a))  can  be  done  using  polynomial-size  monotone 
circuits.  In  particular,  he  showed  that  at  least  gates  are  necessary  for  a  monotone 

circuit  solution  of  perfect  matching.  So  ?PM,  which  is  well-known  to  have  polynomial  non¬ 
monotone  complexity,  has  super-polynomial  monotone  complexity.  Alon  and  Boppana  [242] 
have  since  strengthened  Razborov’s  clique  result  to  show  that  in  fact  there  Is  an  exponential 
lower  bound  for  the  clique  problem  and  not  just  a  superpolynomial  bound. 

The  clique  problem  is  clearly  equivalent  to  the  independent  set  problem  via  graph  com¬ 
plementation.  Hence  Razborov’s  results  deal  precisely  with  matching  and  vertex  packing, 
our  two  paradigm  problems.  Will  monotone  circuits  somehow  prove  to  yield  the  key  to 
the  apparent  separation  in  the  complexity  of  these  two  problems  and  perhaps  even  provide 
the  answer  to  the  P  =  NP  question?  Only  time  will  tell. 

This  brings  us  to  the  end  of  our  survey.  As  is  always  the  case,  more  could  have 
been  included.  There  are  several  important  topics  related  to  the  theme  of  this  paper, 
but  which  we  have  elected  not  to  include.  Among  these  are  weighted  and  capacitated 
matching.  For  useful  references,  see  [243,244,245,246,247,248].  For  probabilistic  analysis 
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of  graph  algorithms,  see  [249,250,251,252,253].  And  for  matching  and  vertex  packing  in 
random  graphs,  see  [254,255,256,257,258,259,260,261,262]. 
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